Commit Graph

  • 9e244c0bed v1.11.1.0 fix: plan-mode handshake + canUseTool test harness (#1182) Garry Tan 2026-04-24 00:04:53 -07:00
  • d52da0bd08 test(setup-gbrain): unit tests for gstack-gbrain-supabase-provision via mock API Garry Tan 2026-04-24 00:01:18 -07:00
  • a8cc0d2465 feat(setup-gbrain): add gstack-gbrain-supabase-provision Management API wrapper Garry Tan 2026-04-24 00:01:18 -07:00
  • 0e97293e43 chore(v1.11.1.0): VERSION bump + CHANGELOG entry + TODOS follow-ups garrytan/fix-ceo-no-askuser Garry Tan 2026-04-23 23:44:11 -07:00
  • d56d5cd947 test(setup-gbrain): unit tests for supabase-verify + lib.sh secret helper Garry Tan 2026-04-23 23:42:07 -07:00
  • bee4e661ea feat(setup-gbrain): add gstack-gbrain-supabase-verify structural URL check Garry Tan 2026-04-23 23:42:07 -07:00
  • 796e1df47a feat(setup-gbrain): add gstack-gbrain-lib.sh with read_secret_to_env (D3-eng) Garry Tan 2026-04-23 23:42:07 -07:00
  • fb7558b518 Merge remote-tracking branch 'origin/main' into garrytan/fix-ceo-no-askuser Garry Tan 2026-04-23 23:41:27 -07:00
  • d46a83b2e1 test: plan-mode handshake E2E coverage and unit assertions Garry Tan 2026-04-23 23:41:13 -07:00
  • 28b14fbf0c feat: extend agent-sdk-runner with canUseTool for AskUserQuestion interception Garry Tan 2026-04-23 23:40:50 -07:00
  • 5e4895c90a feat: plan-mode handshake for interactive review skills Garry Tan 2026-04-23 23:40:36 -07:00
  • a767c3afec test(setup-gbrain): unit tests for gstack-gbrain-detect + install Garry Tan 2026-04-23 23:38:42 -07:00
  • 81b768109e feat(setup-gbrain): add gstack-gbrain-install with D5 detect-first + D19 PATH-shadow guard Garry Tan 2026-04-23 23:38:42 -07:00
  • 2659be2ad9 feat(setup-gbrain): add gstack-gbrain-detect state reporter Garry Tan 2026-04-23 23:38:42 -07:00
  • 637f5e37cf test(setup-gbrain): unit tests for gstack-gbrain-repo-policy Garry Tan 2026-04-23 23:34:03 -07:00
  • 61bcc2d450 feat(setup-gbrain): add gstack-gbrain-repo-policy bin helper Garry Tan 2026-04-23 23:33:37 -07:00
  • e4041f7a7f v1.11.0.0 feat(ship): workspace-aware version allocation (#1168) Garry Tan 2026-04-23 23:03:27 -07:00
  • b85752ddea Merge remote-tracking branch 'origin/main' into garrytan/workspace-aware-ship garrytan/workspace-aware-ship Garry Tan 2026-04-23 22:17:30 -07:00
  • 118331aa6b ci: re-trigger PR workflows after merge Garry Tan 2026-04-23 21:25:41 -07:00
  • a64d70ba35 Merge remote-tracking branch 'origin/main' into garrytan/workspace-aware-ship Garry Tan 2026-04-23 21:20:25 -07:00
  • e3d7f49c74 feat(v1.10.1.0): overlay efficacy harness + Opus 4.7 fanout nudge removal (#1166) Garry Tan 2026-04-23 18:42:58 -07:00
  • 0a4c4e69ba Merge remote-tracking branch 'origin/main' into garrytan/overlay-fanout-eval garrytan/overlay-fanout-eval Garry Tan 2026-04-23 18:27:06 -07:00
  • a81be53621 v1.10.0.0: fix AskUserQuestion cadence + Pros/Cons format upgrade (#1178) Garry Tan 2026-04-23 18:25:34 -07:00
  • 23cd76d463 Merge remote-tracking branch 'origin/main' into garrytan/overlay-fanout-eval Garry Tan 2026-04-23 18:08:54 -07:00
  • cc50f101f7 Merge remote-tracking branch 'origin/main' into garrytan/PRIORITY-broken-ask-user-question garrytan/PRIORITY-broken-ask-user-question Garry Tan 2026-04-23 18:08:00 -07:00
  • 384a5667d6 chore: bump version to 1.10.1.0 Garry Tan 2026-04-23 18:03:48 -07:00
  • 4827e7d672 v1.10.0.0: bump VERSION (was v1.7.0.0, align with branch discipline) Garry Tan 2026-04-23 18:03:06 -07:00
  • 9dbaf906cf feat(v1.9.0.0): gbrain-sync — cross-machine gstack memory (#1151) Garry Tan 2026-04-23 17:54:54 -07:00
  • 09997b7932 chore: bump to v1.9.0.0 for gbrain-sync landing garrytan/gbrain-support Garry Tan 2026-04-23 17:54:23 -07:00
  • 5f038ab762 v1.7.0.0: plan reviews walk you through each issue with Pros/Cons Garry Tan 2026-04-23 17:51:22 -07:00
  • c8d87289b1 test: regenerate golden fixtures + update ELI10 phrase check for v1.7.0.0 Garry Tan 2026-04-23 17:48:30 -07:00
  • 8d57df27a8 test(eval): Sonnet 4.6 variants of the 5 Opus-4.7 fixtures Garry Tan 2026-04-23 17:43:38 -07:00
  • d06f08938f test: gate-tier units + periodic Pros/Cons evals for AskUserQuestion format Garry Tan 2026-04-23 16:48:10 -07:00
  • 6b99df9df7 feat(preamble): upgrade AskUserQuestion format to Pros/Cons decision brief Garry Tan 2026-04-23 16:41:36 -07:00
  • d63b4cd0e0 fix(plan-reviews): tighten STOP/escape-hatch directives across 4 templates Garry Tan 2026-04-23 16:38:21 -07:00
  • cb3713fbf1 fix(preamble): reorder AskUserQuestion Format above model overlay + rewrite Opus 4.7 pacing directive Garry Tan 2026-04-23 16:35:26 -07:00
  • a9a3ac6d2a test(eval): bump maxTurns to 15 for claude-dedicated-tools-vs-bash Garry Tan 2026-04-23 11:34:51 -07:00
  • 5294c65777 fix(eval): handle SDK max-turns throw gracefully Garry Tan 2026-04-23 11:34:51 -07:00
  • 416a56a5c8 fix(ship): exclude current PR from queue-awareness (self-reference bug) Garry Tan 2026-04-23 11:28:02 -07:00
  • bd39b6995f chore: bump version and changelog (v1.8.0.0) Garry Tan 2026-04-23 11:12:42 -07:00
  • 6efd6483e8 Merge remote-tracking branch 'origin/main' into garrytan/workspace-aware-ship Garry Tan 2026-04-23 11:08:17 -07:00
  • cb722a7fa7 docs: versioning invariant in CLAUDE.md Garry Tan 2026-04-23 11:08:12 -07:00
  • d7fa332803 feat(skill): /landing-report read-only queue dashboard Garry Tan 2026-04-23 11:08:12 -07:00
  • f4ec92341c feat(skills): queue-aware /ship + drift abort in /land-and-deploy + advisory in /review Garry Tan 2026-04-23 11:08:12 -07:00
  • 04e2f1bea9 test(eval): 3 more overlay fixtures to measure remaining Claude nudges Garry Tan 2026-04-23 11:08:09 -07:00
  • c46388ac4d feat(ci): version-gate + pr-title-sync workflows (GitHub + GitLab) Garry Tan 2026-04-23 11:07:57 -07:00
  • c7bdfdf304 feat(scripts): detect-bump + compare-pr-version helpers Garry Tan 2026-04-23 11:07:56 -07:00
  • 236e9d91cc test: fixture tests for gstack-next-version Garry Tan 2026-04-23 11:07:56 -07:00
  • 7ed110e57b feat: bin/gstack-next-version util + workspace_root config key Garry Tan 2026-04-23 11:07:46 -07:00
  • 76829b76dc feat(eval): extend OverlayFixture with allowedTools, maxTurns, direction Garry Tan 2026-04-23 11:07:37 -07:00
  • f3b40b12d7 Merge remote-tracking branch 'origin/main' into garrytan/overlay-fanout-eval Garry Tan 2026-04-23 10:54:42 -07:00
  • 8a3f197aea chore: bump version and changelog (v1.6.5.0) Garry Tan 2026-04-23 10:37:25 -07:00
  • 54d3ad923d Merge branch 'main' into garrytan/gbrain-support Garry Tan 2026-04-23 10:36:30 -07:00
  • d75402bbd2 v1.6.4.0: cut Haiku classifier FP from 44% to 23%, gate now enforced (#1135) Garry Tan 2026-04-23 10:23:40 -07:00
  • bfb8ac3b7f Merge remote-tracking branch 'origin/main' into garrytan/overlay-fanout-eval Garry Tan 2026-04-23 09:14:16 -07:00
  • cb5f074d7c fix(opus-4.7): remove "Fan out explicitly" overlay nudge Garry Tan 2026-04-23 09:14:11 -07:00
  • 546404c81f test: register overlay harness in touchfiles (both maps) Garry Tan 2026-04-23 09:13:59 -07:00
  • e432f4bd94 test(eval): paid periodic overlay-efficacy harness Garry Tan 2026-04-23 09:13:58 -07:00
  • 06a862faab test(eval): unit tests for agent-sdk-runner (36 tests, free tier) Garry Tan 2026-04-23 09:13:58 -07:00
  • 6b85262422 feat(eval): parametric overlay-efficacy harness (runner + fixtures) Garry Tan 2026-04-23 09:13:58 -07:00
  • 66887b2f05 feat(preflight): sanity check for agent-sdk + overlay resolver Garry Tan 2026-04-23 09:13:37 -07:00
  • 27e4ee7498 chore: add @anthropic-ai/claude-agent-sdk@0.2.117 dep Garry Tan 2026-04-23 09:13:37 -07:00
  • 6c6fa69191 refactor: export readOverlay from model-overlay resolver Garry Tan 2026-04-23 09:13:37 -07:00
  • c7d6add473 fix(test): session-awareness reads AskUserQuestion Format from a Tier 2+ SKILL.md Garry Tan 2026-04-23 09:10:59 -07:00
  • f43369a453 docs(changelog): rewrite v1.6.4.0; strip process minutiae garrytan/injection-tuning Garry Tan 2026-04-23 09:09:42 -07:00
  • aa6d69a0a8 docs(changelog): strip process minutiae from entries; rewrite v1.6.4.0 Garry Tan 2026-04-23 08:29:38 -07:00
  • a882c05151 docs(changelog): add v1.6.4.0 placeholder entry at top Garry Tan 2026-04-23 07:34:09 -07:00
  • 8407b4930b Merge branch 'main' into garrytan/gbrain-support Garry Tan 2026-04-23 07:30:58 -07:00
  • f94ee63460 merge: origin/main into garrytan/injection-tuning; bump v1.6.2.0 → v1.6.4.0 Garry Tan 2026-04-23 07:28:41 -07:00
  • 69733e2622 fix(plan-reviews): restore RECOMMENDATION + Completeness split + Codex ELI10 (v1.6.3.0) (#1149) Garry Tan 2026-04-23 07:25:20 -07:00
  • 62fa71962c chore: bump version and changelog (v1.6.3.0) garrytan/plan-review-regressions Garry Tan 2026-04-23 07:09:13 -07:00
  • 09c82222ea test: fix Codex eval sandbox + collector API Garry Tan 2026-04-22 22:02:32 -07:00
  • 028627fbcd fix(preamble): harden AskUserQuestion Format + Codex ELI10 carve-out Garry Tan 2026-04-22 21:34:31 -07:00
  • b7f6246061 test: add Codex eval for AskUserQuestion format compliance Garry Tan 2026-04-22 21:34:11 -07:00
  • 12b29c7518 merge: origin/main into garrytan/injection-tuning; bump v1.5.2.0 → v1.6.2.0 Garry Tan 2026-04-22 17:00:33 -07:00
  • 756525100c chore: regenerate SKILL.md files for gbrain-sync preamble block Garry Tan 2026-04-22 13:58:41 -07:00
  • c3f73f91d4 chore: bump version and changelog (v1.7.0.0) Garry Tan 2026-04-22 13:47:12 -07:00
  • 91c734a6af docs(gbrain-sync): user guide + error lookup + README section Garry Tan 2026-04-22 13:47:12 -07:00
  • c064743eda test(gbrain-sync): 27-test consolidated suite Garry Tan 2026-04-22 13:47:12 -07:00
  • a2aa8a07d4 feat(gbrain-sync): preamble block — privacy gate + boundary sync Garry Tan 2026-04-22 13:47:12 -07:00
  • f088fe96f8 feat(gbrain-sync): init, restore, uninstall, consumer registry Garry Tan 2026-04-22 13:47:11 -07:00
  • 97cbacf409 feat(gbrain-sync): --once drain + secret scan + push Garry Tan 2026-04-22 13:47:11 -07:00
  • 45638297ba feat(gbrain-sync): queue primitives + writer shims Garry Tan 2026-04-22 13:47:11 -07:00
  • d591ad29b2 chore: bump version and changelog (v1.6.2.0) Garry Tan 2026-04-22 12:32:14 -07:00
  • 00e8a8599c Merge remote-tracking branch 'origin/main' into garrytan/plan-review-regressions Garry Tan 2026-04-22 12:29:35 -07:00
  • 5fe1814310 fix(plan-reviews): restore RECOMMENDATION + split Completeness by question type Garry Tan 2026-04-22 01:11:05 -07:00
  • 6a46b9099f test: add AskUserQuestion format regression eval for plan reviews Garry Tan 2026-04-22 01:10:35 -07:00
  • 656df0e37e feat(v1.5.2.0): Opus 4.7 migration — model overlay, voice, routing (#1117) Garry Tan 2026-04-22 01:06:22 -07:00
  • 03322352dd test(opus-4.7): key touchfile entries by testName, not describe text feat/opus-4.7-migration Garry Tan 2026-04-22 00:37:27 -07:00
  • 206bf93390 chore(release): v1.6.1.0 — Opus 4.7 migration, reviewed Garry Tan 2026-04-22 00:30:38 -07:00
  • 404c9c925d docs(todos): verify Opus 4.7 fanout nudge in Claude Code harness (P0) Garry Tan 2026-04-22 00:28:54 -07:00
  • d646bc12d8 test(opus-4.7): rewrite scratch-root helper + add afterAll cleanup Garry Tan 2026-04-22 00:27:53 -07:00
  • 723f9957f2 test(opus-4.7): tighten ambiguous /qa routing prompt Garry Tan 2026-04-22 00:27:28 -07:00
  • 36ef9d9db0 refactor(opus-4.7): rewrite fanout nudge to show parallel tool_use pattern Garry Tan 2026-04-22 00:26:26 -07:00
  • 7e90b0f092 test(opus-4.7): E2E eval for fanout rate + routing precision Garry Tan 2026-04-22 00:11:38 -07:00
  • d3742c884a test(team-mode): give setup -q / setup --local tests a 3-minute budget Garry Tan 2026-04-21 23:48:48 -07:00
  • b6be59ab75 test(binary-guard): replace xargs-per-file loops with fs.statSync + mode filter Garry Tan 2026-04-21 23:43:43 -07:00
  • b90bc40295 test(routing): assert slash-prefixed skills + new policy + current names Garry Tan 2026-04-21 23:41:31 -07:00
  • 6701205118 chore(opus-4.7): regenerate SKILL.md files + update golden fixtures Garry Tan 2026-04-21 23:40:07 -07:00
  • da75ebaaa0 refactor(opus-4.7): split overlay, align routing, fix trailer fallback Garry Tan 2026-04-21 23:39:42 -07:00