skill-e2e-plan-ceo-mode-routing.test.ts (~$3/run, 6-10 min/case):
- Verifies AUQ answer routing: HOLD SCOPE → rigor/bulletproof posture
language; SCOPE EXPANSION → expansion/10x/dream language. Each case
navigates 8-12 prior AUQs (telemetry, proactive, routing, vendoring,
brain, office-hours, premise, approach) before hitting Step 0F.
- Periodic, not gate: navigation phase too slow for PR-blocking.
V2 expansion to 4 modes (SELECTIVE + REDUCTION) when nav is faster.
skill-e2e-ship-idempotency.test.ts (~$3/run, 5-10 min):
- Builds a real git fixture with VERSION 0.0.2 already bumped, matching
package.json, CHANGELOG entry, pushed to a local bare remote. Runs
/ship in plan mode and asserts STATE: ALREADY_BUMPED echoes from the
Step 12 idempotency check, OR plan_ready terminates without mutation.
- Snapshots VERSION + package.json + CHANGELOG entry count + commit
count + branch HEAD before/after; fails if any changed.
skill-e2e-autoplan-chain.test.ts (~$8/run, 12-18 min):
- Asserts /autoplan phases run sequentially: tees timestamps as each
"**Phase N complete.**" marker first appears. Phase 1 (CEO) must
precede Phase 3 (Eng); Phase 2 (Design) is optional but if it
appears, must sit between 1 and 3.
- Auto-grants permission dialogs that fire during phase transitions.
All three auto-handle permission dialogs (preamble side-effects on
fresh user envs without .feature-prompted-* markers).
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>