aidokitwiki

2026-05-26 — All 25 pain points closed by v1.x

Thirteen feature branches merged to main this week. The 25-point pain analysis that drove the v1.x roadmap is now fully addressed in code. This post is the ledger — what landed, where to look, what's still deferred, and what we are not claiming.

The 25-point analysis, briefly #

Earlier this cycle we sat down with the Hall of Pain taxonomy and the field reports we have accumulated and produced a flat list of 25 distinct failure modes that unsupervised AI coding agents reliably trip over: context window limits, context poisoning, hallucinated symbols, token cost explosions, stale docs, runaway loops, secrets leakage, license contamination, prompt injection via MCPs, and so on. Each got an owner mechanism. Most were already covered at Standard or Strict tier; the rest are what shipped this week.

What shipped — sub-project ledger #

Thirteen branches, all merged to main and reflected in CHANGELOG.md [Unreleased]:

Adapter cross-cutting #

All of the above works identically across the three first-party adapters (claude-code, codex, copilot). The byte-compare dogfood gate runs for all three — packages/cli/test/fixtures/v4-reference/, packages/cli/test/fixtures/codex-reference/, and packages/cli/test/fixtures/copilot-reference/ all updated together from the same reference-context.json.

Three skills are now always-on per ADR-0020, emitted at every conformance tier regardless of the adapter:

computeFilePlan dedupes the always-on set against the normal tier-gated BASE_SKILLS list by id, so an adapter that already emits one of these at its declared tier does not get a duplicate.

What is deferred #

Wave 0 (refactor/doctor-check-registry) — a refactor of packages/cli/src/commands/doctor.ts into a check-registry pattern. It was queued first but not merged, because SP6, SP8, SP9, and SP10 each added inline doctor checks against the pre-refactor shape and we did not want to re-architect under live patches. Planned follow-up: consolidate the inline checks (MCP_QUARANTINE_SKILL_MISSING, the --model-drift group, the --hygiene group, and the semantic doc-drift pass) into the registry pattern in a single PR.

What now exists in the wiki #

What is gating GA (unchanged) #

The engineering surface is now done in scope; what remains is human work outside the codebase:

We will not ship v1.0 GA until those land — but for the first time since the analysis was written, no engineering blocker stands in front of them.