mirror of
https://github.com/garrytan/gstack.git
synced 2026-06-02 00:01:37 +02:00
v1.53.1.0 fix: non-interactive-safe plan-tune hook install (flags + smart defaults) (#1805)
* feat(config): add plan_tune_hooks setting (prompt|yes|no) Registers a new gstack-config key controlling whether ./setup installs the plan-tune Claude Code hooks. Default "prompt". Documented in the config header and surfaced in `gstack-config defaults` / `list`. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(setup): make plan-tune hook install non-interactive-safe The plan-tune consent prompt used a blocking `read -r` with no timeout. Under a forwarded/automated TTY (conductor workspace setup, CI with a pty) it hung setup forever. Move the decision into flags + env + saved config with a smart default: --plan-tune-hooks / --no-plan-tune-hooks / --plan-tune-hooks=yes|no|prompt > GSTACK_PLAN_TUNE_HOOKS env > plan_tune_hooks config > prompt-on-real-TTY. Explicit yes/no act non-interactively. The remaining interactive branch is gated on a real (non-quiet) TTY and uses a time-bounded `read -t 10 </dev/tty` that defaults to skip, so it can never hang. A timeout no longer persists a decline marker, so a later hands-on run can still offer the install. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(dev-setup): run setup non-interactively in dev/workspace mode Conductor runs bin/dev-setup under a forwarded pty, so any setup prompt (skill-prefix, plan-tune consent) would hang the workspace. Detach stdin (`setup </dev/null`) so every prompt takes its smart non-interactive default: flat skill names, skip the global plan-tune hook install without writing a decline marker. Saved prefix/config preferences are still honored, and a dev workspace no longer silently mutates ~/.claude/settings.json. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(setup): guard plan-tune hooks stay non-interactive Static + binary-level regression test (free, <1s): asserts the flags are wired, the plan-tune read is time-bounded (no bare blocking read), explicit yes/no decisions short-circuit before the prompt, and gstack-config knows the plan_tune_hooks key. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(setup,config): harden plan-tune decision against bad input Review follow-ups to the non-interactive plan-tune work: - setup now lowercases + whitespace-strips the resolved decision before the case match, so an explicit opt-in via flag/env ("YES", "Yes", " yes") is honored instead of silently falling through to "prompt"/skip. Also accepts on/off and 1/0. - gstack-config rejects out-of-domain plan_tune_hooks values (anything but prompt|yes|no) with a warning + fallback to prompt, matching the existing value-whitelist pattern for explain_level / artifacts_sync_mode. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(dev-setup): never mutate global hooks during workspace setup Closing stdin alone only suppresses the prompt branch; a saved `plan_tune_hooks: yes` or exported GSTACK_PLAN_TUNE_HOOKS=yes would still resolve to "install" and rewrite the user's global ~/.claude/settings.json to point at THIS ephemeral worktree — which breaks once the workspace is deleted. Pass --plan-tune-hooks=prompt (highest precedence) so dev-setup pins resolution to prompt-mode; with stdin closed that is a guaranteed no-op skip (no install, no decline marker). To install the hooks, run ./setup --plan-tune-hooks directly. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(setup): isolate config tests from host + cover new guards - Point gstack-config tests at a temp GSTACK_HOME so `get plan_tune_hooks` reads the built-in default, not whatever the host machine has in ~/.gstack/config.yaml (the prior test was non-deterministic). - Add behavioral coverage: yes/no/prompt round-trip, out-of-domain rejection. - Add a normalization guard (decision input is lowercased/trimmed) and a dev-setup guard (runs setup with --plan-tune-hooks=prompt + stdin detached). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: rebaseline parity-suite v1.44.1 -> v1.53.0.0 The frozen v1.44.1 anchor went stale: five planning skills (plan-ceo-review, plan-eng-review, plan-design-review, investigate, office-hours) crept past the 1.05x ceiling via legitimate v1.49-v1.53 growth (brain-aware planning + the v1.53 redaction guard), so `bun test` was red on a clean checkout of main. Capture a fresh baseline at HEAD (bun run scripts/capture-baseline.ts --tag v1.53.0.0) and re-point the test at it. The per-skill 1.05 ratio is kept, so future bloat is still caught; only the anchor moved. Mirrors the earlier skill-size-budget rebase (v1.44.1 -> v1.47.0.0). Historical v1.44.1 / v1.46.0.0 / v1.47.0.0 baselines are retained for the v1->v2 audit trail. The captured skill bytes equal origin/main exactly (this branch left every SKILL.md untouched). Clears the pre-existing failures noted in the v1.53.0.0 CHANGELOG. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(plan-tune): de-flake "derive pushes scope_appetite up" The test was ~25-50% flaky (worse on main). gstack-question-log fires a fire-and-forget background `--derive` after every write; the 5 rapid log writes spawned 5 racing background derives that collided with the test's explicit --derive — a late one that only saw 3 entries could clobber developer-profile.json after the explicit one wrote sample_size=5. Set GSTACK_QUESTION_LOG_NO_DERIVE=1 (the flag the binary documents for exactly this case) so the writes don't spawn background derives. The explicit --derive still runs, so real derive behavior is still asserted. 20/20 green after. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * chore: bump version and changelog (v1.53.1.0) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * docs: document non-interactive dev-setup + plan-tune hook flags (v1.53.1.0) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
+633
@@ -0,0 +1,633 @@
|
||||
{
|
||||
"tag": "v1.53.0.0",
|
||||
"capturedAt": "2026-05-30T18:00:56.209Z",
|
||||
"capturedFromCommit": "352f6a57",
|
||||
"capturedFromBranch": "garrytan/setup-plan-tune-hooks-flags",
|
||||
"totalSkills": 52,
|
||||
"totalCorpusBytes": 3179282,
|
||||
"estTotalCatalogTokens": 4116,
|
||||
"topHeaviest": [
|
||||
{
|
||||
"skill": "ship",
|
||||
"skillMdBytes": 170491,
|
||||
"skillMdLines": 3153,
|
||||
"estTokens": 42623,
|
||||
"tmplBytes": 53240,
|
||||
"descriptionLen": 291,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
{
|
||||
"skill": "plan-ceo-review",
|
||||
"skillMdBytes": 137751,
|
||||
"skillMdLines": 2290,
|
||||
"estTokens": 34438,
|
||||
"tmplBytes": 63461,
|
||||
"descriptionLen": 794,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
{
|
||||
"skill": "office-hours",
|
||||
"skillMdBytes": 118280,
|
||||
"skillMdLines": 2161,
|
||||
"estTokens": 29570,
|
||||
"tmplBytes": 55534,
|
||||
"descriptionLen": 860,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
{
|
||||
"skill": "plan-design-review",
|
||||
"skillMdBytes": 112728,
|
||||
"skillMdLines": 2019,
|
||||
"estTokens": 28182,
|
||||
"tmplBytes": 28717,
|
||||
"descriptionLen": 218,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
{
|
||||
"skill": "plan-devex-review",
|
||||
"skillMdBytes": 111292,
|
||||
"skillMdLines": 2212,
|
||||
"estTokens": 27823,
|
||||
"tmplBytes": 35773,
|
||||
"descriptionLen": 250,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
{
|
||||
"skill": "spec",
|
||||
"skillMdBytes": 109688,
|
||||
"skillMdLines": 2239,
|
||||
"estTokens": 27422,
|
||||
"tmplBytes": 30590,
|
||||
"descriptionLen": 282,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
{
|
||||
"skill": "plan-eng-review",
|
||||
"skillMdBytes": 107655,
|
||||
"skillMdLines": 1849,
|
||||
"estTokens": 26914,
|
||||
"tmplBytes": 26302,
|
||||
"descriptionLen": 231,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
{
|
||||
"skill": "design-review",
|
||||
"skillMdBytes": 96618,
|
||||
"skillMdLines": 1936,
|
||||
"estTokens": 24155,
|
||||
"tmplBytes": 11674,
|
||||
"descriptionLen": 304,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
{
|
||||
"skill": "review",
|
||||
"skillMdBytes": 95012,
|
||||
"skillMdLines": 1766,
|
||||
"estTokens": 23753,
|
||||
"tmplBytes": 14099,
|
||||
"descriptionLen": 205,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
{
|
||||
"skill": "land-and-deploy",
|
||||
"skillMdBytes": 92850,
|
||||
"skillMdLines": 1860,
|
||||
"estTokens": 23213,
|
||||
"tmplBytes": 48624,
|
||||
"descriptionLen": 160,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
}
|
||||
],
|
||||
"skills": {
|
||||
"autoplan": {
|
||||
"skill": "autoplan",
|
||||
"skillMdBytes": 91834,
|
||||
"skillMdLines": 1788,
|
||||
"estTokens": 22959,
|
||||
"tmplBytes": 45271,
|
||||
"descriptionLen": 366,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
"benchmark": {
|
||||
"skill": "benchmark",
|
||||
"skillMdBytes": 33266,
|
||||
"skillMdLines": 747,
|
||||
"estTokens": 8317,
|
||||
"tmplBytes": 9378,
|
||||
"descriptionLen": 213,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"benchmark-models": {
|
||||
"skill": "benchmark-models",
|
||||
"skillMdBytes": 29333,
|
||||
"skillMdLines": 622,
|
||||
"estTokens": 7333,
|
||||
"tmplBytes": 6631,
|
||||
"descriptionLen": 217,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"browse": {
|
||||
"skill": "browse",
|
||||
"skillMdBytes": 48151,
|
||||
"skillMdLines": 930,
|
||||
"estTokens": 12038,
|
||||
"tmplBytes": 10805,
|
||||
"descriptionLen": 181,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"canary": {
|
||||
"skill": "canary",
|
||||
"skillMdBytes": 48069,
|
||||
"skillMdLines": 994,
|
||||
"estTokens": 12017,
|
||||
"tmplBytes": 8033,
|
||||
"descriptionLen": 180,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"careful": {
|
||||
"skill": "careful",
|
||||
"skillMdBytes": 2551,
|
||||
"skillMdLines": 68,
|
||||
"estTokens": 638,
|
||||
"tmplBytes": 2435,
|
||||
"descriptionLen": 315,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"codex": {
|
||||
"skill": "codex",
|
||||
"skillMdBytes": 80584,
|
||||
"skillMdLines": 1523,
|
||||
"estTokens": 20146,
|
||||
"tmplBytes": 34143,
|
||||
"descriptionLen": 187,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"context-restore": {
|
||||
"skill": "context-restore",
|
||||
"skillMdBytes": 42457,
|
||||
"skillMdLines": 852,
|
||||
"estTokens": 10614,
|
||||
"tmplBytes": 5255,
|
||||
"descriptionLen": 238,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"context-save": {
|
||||
"skill": "context-save",
|
||||
"skillMdBytes": 46654,
|
||||
"skillMdLines": 970,
|
||||
"estTokens": 11664,
|
||||
"tmplBytes": 9293,
|
||||
"descriptionLen": 168,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"cso": {
|
||||
"skill": "cso",
|
||||
"skillMdBytes": 78849,
|
||||
"skillMdLines": 1462,
|
||||
"estTokens": 19712,
|
||||
"tmplBytes": 35646,
|
||||
"descriptionLen": 196,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"design-consultation": {
|
||||
"skill": "design-consultation",
|
||||
"skillMdBytes": 80186,
|
||||
"skillMdLines": 1565,
|
||||
"estTokens": 20047,
|
||||
"tmplBytes": 25899,
|
||||
"descriptionLen": 888,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"design-html": {
|
||||
"skill": "design-html",
|
||||
"skillMdBytes": 67511,
|
||||
"skillMdLines": 1453,
|
||||
"estTokens": 16878,
|
||||
"tmplBytes": 22567,
|
||||
"descriptionLen": 233,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"design-review": {
|
||||
"skill": "design-review",
|
||||
"skillMdBytes": 96618,
|
||||
"skillMdLines": 1936,
|
||||
"estTokens": 24155,
|
||||
"tmplBytes": 11674,
|
||||
"descriptionLen": 304,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"design-shotgun": {
|
||||
"skill": "design-shotgun",
|
||||
"skillMdBytes": 63800,
|
||||
"skillMdLines": 1315,
|
||||
"estTokens": 15950,
|
||||
"tmplBytes": 13331,
|
||||
"descriptionLen": 786,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"devex-review": {
|
||||
"skill": "devex-review",
|
||||
"skillMdBytes": 65377,
|
||||
"skillMdLines": 1237,
|
||||
"estTokens": 16344,
|
||||
"tmplBytes": 7984,
|
||||
"descriptionLen": 201,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"document-generate": {
|
||||
"skill": "document-generate",
|
||||
"skillMdBytes": 54797,
|
||||
"skillMdLines": 1194,
|
||||
"estTokens": 13699,
|
||||
"tmplBytes": 15939,
|
||||
"descriptionLen": 334,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"document-release": {
|
||||
"skill": "document-release",
|
||||
"skillMdBytes": 59827,
|
||||
"skillMdLines": 1248,
|
||||
"estTokens": 14957,
|
||||
"tmplBytes": 20974,
|
||||
"descriptionLen": 192,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"freeze": {
|
||||
"skill": "freeze",
|
||||
"skillMdBytes": 3154,
|
||||
"skillMdLines": 92,
|
||||
"estTokens": 789,
|
||||
"tmplBytes": 3038,
|
||||
"descriptionLen": 503,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"gstack-upgrade": {
|
||||
"skill": "gstack-upgrade",
|
||||
"skillMdBytes": 10817,
|
||||
"skillMdLines": 285,
|
||||
"estTokens": 2704,
|
||||
"tmplBytes": 10667,
|
||||
"descriptionLen": 163,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"guard": {
|
||||
"skill": "guard",
|
||||
"skillMdBytes": 3297,
|
||||
"skillMdLines": 91,
|
||||
"estTokens": 824,
|
||||
"tmplBytes": 3181,
|
||||
"descriptionLen": 686,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"health": {
|
||||
"skill": "health",
|
||||
"skillMdBytes": 48880,
|
||||
"skillMdLines": 1018,
|
||||
"estTokens": 12220,
|
||||
"tmplBytes": 11617,
|
||||
"descriptionLen": 184,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"investigate": {
|
||||
"skill": "investigate",
|
||||
"skillMdBytes": 51373,
|
||||
"skillMdLines": 1016,
|
||||
"estTokens": 12843,
|
||||
"tmplBytes": 11561,
|
||||
"descriptionLen": 1379,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"ios-clean": {
|
||||
"skill": "ios-clean",
|
||||
"skillMdBytes": 42009,
|
||||
"skillMdLines": 817,
|
||||
"estTokens": 10502,
|
||||
"tmplBytes": 3851,
|
||||
"descriptionLen": 252,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"ios-design-review": {
|
||||
"skill": "ios-design-review",
|
||||
"skillMdBytes": 42595,
|
||||
"skillMdLines": 819,
|
||||
"estTokens": 10649,
|
||||
"tmplBytes": 4417,
|
||||
"descriptionLen": 209,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"ios-fix": {
|
||||
"skill": "ios-fix",
|
||||
"skillMdBytes": 41724,
|
||||
"skillMdLines": 815,
|
||||
"estTokens": 10431,
|
||||
"tmplBytes": 3574,
|
||||
"descriptionLen": 187,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"ios-qa": {
|
||||
"skill": "ios-qa",
|
||||
"skillMdBytes": 48235,
|
||||
"skillMdLines": 935,
|
||||
"estTokens": 12059,
|
||||
"tmplBytes": 10090,
|
||||
"descriptionLen": 223,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"ios-sync": {
|
||||
"skill": "ios-sync",
|
||||
"skillMdBytes": 41701,
|
||||
"skillMdLines": 808,
|
||||
"estTokens": 10425,
|
||||
"tmplBytes": 3544,
|
||||
"descriptionLen": 269,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"land-and-deploy": {
|
||||
"skill": "land-and-deploy",
|
||||
"skillMdBytes": 92850,
|
||||
"skillMdLines": 1860,
|
||||
"estTokens": 23213,
|
||||
"tmplBytes": 48624,
|
||||
"descriptionLen": 160,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"landing-report": {
|
||||
"skill": "landing-report",
|
||||
"skillMdBytes": 44949,
|
||||
"skillMdLines": 878,
|
||||
"estTokens": 11237,
|
||||
"tmplBytes": 6806,
|
||||
"descriptionLen": 195,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"learn": {
|
||||
"skill": "learn",
|
||||
"skillMdBytes": 42686,
|
||||
"skillMdLines": 895,
|
||||
"estTokens": 10672,
|
||||
"tmplBytes": 5594,
|
||||
"descriptionLen": 178,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"make-pdf": {
|
||||
"skill": "make-pdf",
|
||||
"skillMdBytes": 29890,
|
||||
"skillMdLines": 670,
|
||||
"estTokens": 7473,
|
||||
"tmplBytes": 5546,
|
||||
"descriptionLen": 177,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"office-hours": {
|
||||
"skill": "office-hours",
|
||||
"skillMdBytes": 118280,
|
||||
"skillMdLines": 2161,
|
||||
"estTokens": 29570,
|
||||
"tmplBytes": 55534,
|
||||
"descriptionLen": 860,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"open-gstack-browser": {
|
||||
"skill": "open-gstack-browser",
|
||||
"skillMdBytes": 47095,
|
||||
"skillMdLines": 958,
|
||||
"estTokens": 11774,
|
||||
"tmplBytes": 7702,
|
||||
"descriptionLen": 204,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"pair-agent": {
|
||||
"skill": "pair-agent",
|
||||
"skillMdBytes": 47903,
|
||||
"skillMdLines": 1014,
|
||||
"estTokens": 11976,
|
||||
"tmplBytes": 8548,
|
||||
"descriptionLen": 167,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"plan-ceo-review": {
|
||||
"skill": "plan-ceo-review",
|
||||
"skillMdBytes": 137751,
|
||||
"skillMdLines": 2290,
|
||||
"estTokens": 34438,
|
||||
"tmplBytes": 63461,
|
||||
"descriptionLen": 794,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
"plan-design-review": {
|
||||
"skill": "plan-design-review",
|
||||
"skillMdBytes": 112728,
|
||||
"skillMdLines": 2019,
|
||||
"estTokens": 28182,
|
||||
"tmplBytes": 28717,
|
||||
"descriptionLen": 218,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
"plan-devex-review": {
|
||||
"skill": "plan-devex-review",
|
||||
"skillMdBytes": 111292,
|
||||
"skillMdLines": 2212,
|
||||
"estTokens": 27823,
|
||||
"tmplBytes": 35773,
|
||||
"descriptionLen": 250,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
"plan-eng-review": {
|
||||
"skill": "plan-eng-review",
|
||||
"skillMdBytes": 107655,
|
||||
"skillMdLines": 1849,
|
||||
"estTokens": 26914,
|
||||
"tmplBytes": 26302,
|
||||
"descriptionLen": 231,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
"plan-tune": {
|
||||
"skill": "plan-tune",
|
||||
"skillMdBytes": 64017,
|
||||
"skillMdLines": 1355,
|
||||
"estTokens": 16004,
|
||||
"tmplBytes": 26922,
|
||||
"descriptionLen": 325,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"qa": {
|
||||
"skill": "qa",
|
||||
"skillMdBytes": 74827,
|
||||
"skillMdLines": 1626,
|
||||
"estTokens": 18707,
|
||||
"tmplBytes": 12701,
|
||||
"descriptionLen": 218,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"qa-only": {
|
||||
"skill": "qa-only",
|
||||
"skillMdBytes": 57385,
|
||||
"skillMdLines": 1198,
|
||||
"estTokens": 14346,
|
||||
"tmplBytes": 3851,
|
||||
"descriptionLen": 165,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"retro": {
|
||||
"skill": "retro",
|
||||
"skillMdBytes": 83853,
|
||||
"skillMdLines": 1754,
|
||||
"estTokens": 20963,
|
||||
"tmplBytes": 42427,
|
||||
"descriptionLen": 648,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"review": {
|
||||
"skill": "review",
|
||||
"skillMdBytes": 95012,
|
||||
"skillMdLines": 1766,
|
||||
"estTokens": 23753,
|
||||
"tmplBytes": 14099,
|
||||
"descriptionLen": 205,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"scrape": {
|
||||
"skill": "scrape",
|
||||
"skillMdBytes": 44605,
|
||||
"skillMdLines": 891,
|
||||
"estTokens": 11151,
|
||||
"tmplBytes": 5220,
|
||||
"descriptionLen": 167,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"setup-browser-cookies": {
|
||||
"skill": "setup-browser-cookies",
|
||||
"skillMdBytes": 26618,
|
||||
"skillMdLines": 594,
|
||||
"estTokens": 6655,
|
||||
"tmplBytes": 2724,
|
||||
"descriptionLen": 222,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"setup-deploy": {
|
||||
"skill": "setup-deploy",
|
||||
"skillMdBytes": 44891,
|
||||
"skillMdLines": 923,
|
||||
"estTokens": 11223,
|
||||
"tmplBytes": 7780,
|
||||
"descriptionLen": 197,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"setup-gbrain": {
|
||||
"skill": "setup-gbrain",
|
||||
"skillMdBytes": 81964,
|
||||
"skillMdLines": 1777,
|
||||
"estTokens": 20491,
|
||||
"tmplBytes": 44851,
|
||||
"descriptionLen": 323,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"ship": {
|
||||
"skill": "ship",
|
||||
"skillMdBytes": 170491,
|
||||
"skillMdLines": 3153,
|
||||
"estTokens": 42623,
|
||||
"tmplBytes": 53240,
|
||||
"descriptionLen": 291,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": true
|
||||
},
|
||||
"skillify": {
|
||||
"skill": "skillify",
|
||||
"skillMdBytes": 54498,
|
||||
"skillMdLines": 1172,
|
||||
"estTokens": 13625,
|
||||
"tmplBytes": 15107,
|
||||
"descriptionLen": 233,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"spec": {
|
||||
"skill": "spec",
|
||||
"skillMdBytes": 109688,
|
||||
"skillMdLines": 2239,
|
||||
"estTokens": 27422,
|
||||
"tmplBytes": 30590,
|
||||
"descriptionLen": 282,
|
||||
"hasGateEval": true,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"sync-gbrain": {
|
||||
"skill": "sync-gbrain",
|
||||
"skillMdBytes": 53201,
|
||||
"skillMdLines": 1070,
|
||||
"estTokens": 13300,
|
||||
"tmplBytes": 16077,
|
||||
"descriptionLen": 299,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
},
|
||||
"unfreeze": {
|
||||
"skill": "unfreeze",
|
||||
"skillMdBytes": 1504,
|
||||
"skillMdLines": 49,
|
||||
"estTokens": 376,
|
||||
"tmplBytes": 1386,
|
||||
"descriptionLen": 199,
|
||||
"hasGateEval": false,
|
||||
"hasPeriodicEval": false
|
||||
}
|
||||
}
|
||||
}
|
||||
@@ -2,9 +2,16 @@
|
||||
* Cathedral parity suite — gate-tier (free, structural + content checks).
|
||||
*
|
||||
* Runs every PARITY_INVARIANTS check against the current SKILL.md output
|
||||
* vs the v1.44.1 baseline. Failures get an actionable, per-skill report
|
||||
* vs the v1.53.0.0 baseline. Failures get an actionable, per-skill report
|
||||
* showing missing phrases, missing headings, and size ratios.
|
||||
*
|
||||
* Baseline rebased v1.44.1 → v1.53.0.0: the brain-aware-planning releases
|
||||
* (v1.49–v1.52) plus the v1.53 redaction guard pushed five planning skills
|
||||
* past the 5% ratchet on the frozen v1.44.1 anchor. Rebasing absorbs that
|
||||
* legitimate growth at HEAD while keeping the per-skill 1.05 ratio so future
|
||||
* bloat is still caught. Historical v1.44.1 / v1.46.0.0 / v1.47.0.0 baselines
|
||||
* are retained in test/fixtures/ for the v1→v2 audit trail.
|
||||
*
|
||||
* Periodic-tier LLM-judge parity (paid) lands in Phase B (v2.0.0.0)
|
||||
* alongside the sections/ extraction. Plumbing is in parity-harness.ts.
|
||||
*/
|
||||
@@ -16,9 +23,9 @@ import { runParityChecks, PARITY_INVARIANTS } from './helpers/parity-harness';
|
||||
import type { ParityBaseline } from './helpers/capture-parity-baseline';
|
||||
|
||||
const REPO_ROOT = path.resolve(import.meta.dir, '..');
|
||||
const BASELINE_PATH = path.join(REPO_ROOT, 'test', 'fixtures', 'parity-baseline-v1.44.1.json');
|
||||
const BASELINE_PATH = path.join(REPO_ROOT, 'test', 'fixtures', 'parity-baseline-v1.53.0.0.json');
|
||||
|
||||
describe('parity suite vs v1.44.1 baseline (gate, free)', () => {
|
||||
describe('parity suite vs v1.53.0.0 baseline (gate, free)', () => {
|
||||
test('baseline exists', () => {
|
||||
expect(fs.existsSync(BASELINE_PATH)).toBe(true);
|
||||
});
|
||||
@@ -43,7 +50,7 @@ describe('parity suite vs v1.44.1 baseline (gate, free)', () => {
|
||||
.map(d => ` ${d.skill}:\n - ${d.failures.join('\n - ')}`)
|
||||
.join('\n');
|
||||
throw new Error(
|
||||
`${report.failed} skill(s) failed parity checks vs v1.44.1:\n${failureMessages}`,
|
||||
`${report.failed} skill(s) failed parity checks vs ${baseline.tag}:\n${failureMessages}`,
|
||||
);
|
||||
});
|
||||
});
|
||||
|
||||
@@ -535,7 +535,15 @@ describe('end-to-end pipeline (binaries working together)', () => {
|
||||
test('log many expand choices → derive pushes scope_appetite up', () => {
|
||||
const tmpHome = fs.mkdtempSync(path.join(require('os').tmpdir(), 'gstack-e2e-'));
|
||||
try {
|
||||
const env = { ...process.env, GSTACK_HOME: tmpHome };
|
||||
// GSTACK_QUESTION_LOG_NO_DERIVE=1 suppresses gstack-question-log's
|
||||
// fire-and-forget background `--derive` (it nohups one per write). Without
|
||||
// it, the 5 rapid log writes spawn 5 racing background derives that collide
|
||||
// with this test's explicit --derive below — a late background derive that
|
||||
// only saw 3 entries can clobber developer-profile.json after the explicit
|
||||
// one wrote sample_size=5, making the test flaky (~25-50% fail). The binary
|
||||
// documents this flag for exactly this case. The explicit --derive still
|
||||
// runs (it ignores the flag), so real derive behavior is still asserted.
|
||||
const env = { ...process.env, GSTACK_HOME: tmpHome, GSTACK_QUESTION_LOG_NO_DERIVE: '1' };
|
||||
const { spawnSync } = require('child_process');
|
||||
const logBin = path.join(ROOT, 'bin', 'gstack-question-log');
|
||||
const devBin = path.join(ROOT, 'bin', 'gstack-developer-profile');
|
||||
|
||||
@@ -0,0 +1,123 @@
|
||||
import { describe, test, expect, beforeAll, afterAll } from 'bun:test';
|
||||
import * as fs from 'fs';
|
||||
import * as os from 'os';
|
||||
import * as path from 'path';
|
||||
import { execSync } from 'child_process';
|
||||
|
||||
// Regression guard for the conductor/workspace setup hang:
|
||||
// `./setup` used a blocking `read -r` to ask "Install both hooks now? [y/N]".
|
||||
// When setup runs under a forwarded/automated TTY (conductor workspace setup,
|
||||
// CI with a pty) the read blocked forever. The fix moves the decision into
|
||||
// flags + env + saved config with a non-blocking, time-bounded prompt fallback.
|
||||
//
|
||||
// These are static + binary-level assertions (free, <1s) — they lock in the
|
||||
// contract without running the full (environment-mutating) setup script.
|
||||
|
||||
const ROOT = path.resolve(import.meta.dir, '..');
|
||||
const SETUP = path.join(ROOT, 'setup');
|
||||
const GSTACK_CONFIG = path.join(ROOT, 'bin', 'gstack-config');
|
||||
|
||||
const setupSrc = fs.readFileSync(SETUP, 'utf-8');
|
||||
|
||||
describe('setup: plan-tune hooks are non-interactive-safe', () => {
|
||||
test('exposes --plan-tune-hooks / --no-plan-tune-hooks / =value flags', () => {
|
||||
expect(setupSrc).toContain('--plan-tune-hooks)');
|
||||
expect(setupSrc).toContain('--no-plan-tune-hooks)');
|
||||
expect(setupSrc).toContain('--plan-tune-hooks=*)');
|
||||
});
|
||||
|
||||
test('resolution falls through env then saved config', () => {
|
||||
expect(setupSrc).toContain('GSTACK_PLAN_TUNE_HOOKS');
|
||||
expect(setupSrc).toContain('get plan_tune_hooks');
|
||||
});
|
||||
|
||||
test('explicit yes/no decisions never reach a prompt', () => {
|
||||
// The yes/no branches must short-circuit before the interactive branch.
|
||||
const yesIdx = setupSrc.indexOf('PT_DECISION" = "yes"');
|
||||
const noIdx = setupSrc.indexOf('PT_DECISION" = "no"');
|
||||
const promptIdx = setupSrc.indexOf('Install both hooks now?');
|
||||
expect(yesIdx).toBeGreaterThan(-1);
|
||||
expect(noIdx).toBeGreaterThan(-1);
|
||||
expect(yesIdx).toBeLessThan(promptIdx);
|
||||
expect(noIdx).toBeLessThan(promptIdx);
|
||||
});
|
||||
|
||||
test('the interactive prompt is time-bounded (cannot hang)', () => {
|
||||
// No bare blocking read for the plan-tune reply.
|
||||
expect(setupSrc).not.toMatch(/read -r PLAN_TUNE_INSTALL_REPLY\b/);
|
||||
// It must use a timed read from the controlling tty with an empty fallback.
|
||||
// The timeout may be a literal or a named variable (e.g. "$_PT_PROMPT_TIMEOUT").
|
||||
expect(setupSrc).toMatch(/read -t (?:\d+|"?\$\{?\w+\}?"?) -r PLAN_TUNE_INSTALL_REPLY <\/dev\/tty/);
|
||||
});
|
||||
|
||||
test('interactive prompt is gated on a real TTY and non-quiet', () => {
|
||||
// The prompt branch requires both stdin+stdout TTYs and not --quiet.
|
||||
expect(setupSrc).toMatch(/\[ "\$QUIET" -ne 1 \] && \[ -t 0 \] && \[ -t 1 \]/);
|
||||
});
|
||||
|
||||
test('decision input is normalized (lowercase + whitespace-stripped)', () => {
|
||||
// "YES" / " yes" from a flag/env must not silently downgrade to skip.
|
||||
expect(setupSrc).toMatch(/tr '\[:upper:\]' '\[:lower:\]'/);
|
||||
expect(setupSrc).toMatch(/PT_DECISION=\$\(printf .* tr/);
|
||||
});
|
||||
});
|
||||
|
||||
describe('dev-setup: never silently mutates global settings.json', () => {
|
||||
const DEV_SETUP = path.join(ROOT, 'bin', 'dev-setup');
|
||||
const devSetupSrc = fs.readFileSync(DEV_SETUP, 'utf-8');
|
||||
|
||||
test('runs setup with stdin detached AND --plan-tune-hooks=prompt pin', () => {
|
||||
// stdin alone only suppresses the prompt branch; the flag (highest
|
||||
// precedence) is what stops a saved `plan_tune_hooks: yes` / env opt-in
|
||||
// from rewriting global hooks to the ephemeral worktree path.
|
||||
expect(devSetupSrc).toMatch(/setup" --plan-tune-hooks=prompt <\/dev\/null/);
|
||||
});
|
||||
});
|
||||
|
||||
describe('gstack-config: plan_tune_hooks key', () => {
|
||||
// Isolate state: gstack-config reads $GSTACK_HOME/config.yaml. Point it at a
|
||||
// fresh temp dir so `get` returns the built-in default rather than whatever
|
||||
// the host machine has in ~/.gstack/config.yaml (which would make the
|
||||
// default-value assertion non-deterministic).
|
||||
let tmpHome: string;
|
||||
let env: NodeJS.ProcessEnv;
|
||||
|
||||
beforeAll(() => {
|
||||
tmpHome = fs.mkdtempSync(path.join(os.tmpdir(), 'gstack-cfg-test-'));
|
||||
env = { ...process.env, GSTACK_HOME: tmpHome };
|
||||
});
|
||||
|
||||
afterAll(() => {
|
||||
fs.rmSync(tmpHome, { recursive: true, force: true });
|
||||
});
|
||||
|
||||
test('default is "prompt"', () => {
|
||||
const out = execSync(`${GSTACK_CONFIG} get plan_tune_hooks`, {
|
||||
encoding: 'utf-8',
|
||||
env,
|
||||
}).trim();
|
||||
expect(out).toBe('prompt');
|
||||
});
|
||||
|
||||
test('appears in defaults and list output', () => {
|
||||
const defaults = execSync(`${GSTACK_CONFIG} defaults`, { encoding: 'utf-8', env });
|
||||
expect(defaults).toContain('plan_tune_hooks');
|
||||
const list = execSync(`${GSTACK_CONFIG} list`, { encoding: 'utf-8', env });
|
||||
expect(list).toContain('plan_tune_hooks');
|
||||
});
|
||||
|
||||
test('accepts valid values (round-trips yes/no/prompt)', () => {
|
||||
for (const v of ['yes', 'no', 'prompt']) {
|
||||
execSync(`${GSTACK_CONFIG} set plan_tune_hooks ${v}`, { encoding: 'utf-8', env });
|
||||
const got = execSync(`${GSTACK_CONFIG} get plan_tune_hooks`, { encoding: 'utf-8', env }).trim();
|
||||
expect(got).toBe(v);
|
||||
}
|
||||
});
|
||||
|
||||
test('rejects out-of-domain values (warns + falls back to prompt)', () => {
|
||||
const res = execSync(`${GSTACK_CONFIG} set plan_tune_hooks maybe 2>&1`, { encoding: 'utf-8', env });
|
||||
expect(res.toLowerCase()).toContain('not recognized');
|
||||
const got = execSync(`${GSTACK_CONFIG} get plan_tune_hooks`, { encoding: 'utf-8', env }).trim();
|
||||
expect(got).toBe('prompt');
|
||||
});
|
||||
});
|
||||
Reference in New Issue
Block a user