gstack/test at a2ee09519c26d53afb29349e09ed72340957f959 - gstack - MS-GitHub-Backup (Gitea)

CalvinBackup/gstack

mirror of https://github.com/garrytan/gstack.git synced 2026-06-22 09:39:59 +02:00

Files

T

History

Garry Tan a2ee09519c fix: journey routing tests — CLAUDE.md routing rules + stronger descriptions

Three journey E2E tests (ideation, ship, debug) were failing because
Claude answered directly instead of invoking the Skill tool. Root cause:
skill descriptions in system-reminder are too weak to override Claude's
default behavior for tasks it can handle natively.

Fix has two parts:
1. CLAUDE.md routing rules in test workdir — Claude weighs project-level
   instructions higher than skill description metadata
2. "Proactively invoke" (not "suggest") in office-hours, investigate,
   ship descriptions — reinforces the routing signal

10/10 journey tests now pass (was 7/10).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-29 20:44:11 -07:00

..

feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )

2026-03-22 11:28:16 -07:00

feat: GStack Learns — per-project self-learning infrastructure (v0.13.4.0) (#622 )

2026-03-29 17:02:01 -06:00

analytics.test.ts

feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )

2026-03-18 23:57:59 -05:00

audit-compliance.test.ts

fix: security audit compliance — credentials, telemetry, bun pin, untrusted warning (v0.12.12.0) (#574 )

2026-03-27 12:06:58 -06:00

codex-e2e.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

gemini-e2e.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00

gen-skill-docs.test.ts

test: add resolver tests for INVOKE_SKILL, CHANGELOG_WORKFLOW, parameterized args

2026-03-29 16:12:10 -07:00

global-discover.test.ts

feat: /retro global — cross-project AI coding retrospective (v0.10.2.0) (#316 )

2026-03-22 13:52:47 -07:00

hook-scripts.test.ts

feat: safety hook skills + skill usage telemetry (v0.7.1) (#189 )

2026-03-18 23:57:59 -05:00

learnings.test.ts

feat: GStack Learns — per-project self-learning infrastructure (v0.13.4.0) (#622 )

2026-03-29 17:02:01 -06:00

review-log.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

skill-e2e-bws.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

skill-e2e-cso.test.ts

feat: /cso v2 — infrastructure-first security audit (v0.11.6.0) (#384 )

2026-03-23 06:57:22 -07:00

skill-e2e-deploy.test.ts

feat: /land-and-deploy first-run dry run + staging-first + trust ladder (v0.12.2.0) (#518 )

2026-03-26 11:08:31 -07:00

skill-e2e-design.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-learnings.test.ts

feat: GStack Learns — per-project self-learning infrastructure (v0.13.4.0) (#622 )

2026-03-29 17:02:01 -06:00

skill-e2e-plan.test.ts

test: E2E tests for plan review report and Codex offering (v0.11.15.0) (#449 )

2026-03-24 07:30:24 -07:00

skill-e2e-qa-bugs.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-qa-workflow.test.ts

feat: CI evals on Ubicloud — 12 parallel runners + Docker image (v0.11.10.0) (#360 )

2026-03-23 10:17:33 -07:00

skill-e2e-review.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

skill-e2e-sidebar.test.ts

fix: sidebar agent uses real tab URL instead of stale Playwright URL (v0.12.6.0) (#544 )

2026-03-26 22:07:03 -06:00

skill-e2e-workflow.test.ts

feat: 2-tier E2E test system — granular touchfiles + gate/periodic split (v0.11.16.0) (#450 )

2026-03-24 15:24:00 -07:00

skill-e2e.test.ts

feat: test coverage catalog — shared audit across plan/ship/review (v0.10.1.0) (#259 )

2026-03-22 11:28:16 -07:00

skill-llm-eval.test.ts

feat: voice directive for all skills (v0.12.3.0) (#520 )

2026-03-26 17:31:53 -06:00

skill-parser.test.ts

feat: SKILL.md template system, 3-tier testing, DX tools (v0.3.3) (#41 )

2026-03-13 21:08:12 -07:00

skill-routing-e2e.test.ts

fix: journey routing tests — CLAUDE.md routing rules + stronger descriptions

2026-03-29 20:44:11 -07:00

skill-validation.test.ts

fix: journey routing tests — CLAUDE.md routing rules + stronger descriptions

2026-03-29 20:44:11 -07:00

telemetry.test.ts

fix: security audit remediation — 12 fixes, 20 tests (v0.13.1.0) (#595 )

2026-03-28 08:35:24 -06:00

touchfiles.test.ts

feat: 2-tier E2E test system — granular touchfiles + gate/periodic split (v0.11.16.0) (#450 )

2026-03-24 15:24:00 -07:00

uninstall.test.ts

feat: community PRs — faster install, skill namespacing, uninstall, Codex fallback, Windows fix, Python patterns (v0.12.9.0) (#561 )

2026-03-27 00:44:37 -06:00

worktree.test.ts

feat: worktree isolation for E2E tests + infrastructure elegance (v0.11.12.0) (#425 )

2026-03-23 23:05:22 -07:00