mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-06 21:46:40 +02:00
merge: incorporate origin/main into community-mode branch
Resolve conflicts from v0.11.13.0 merge (worktree isolation + resolver refactor). Keep 0.12.0.0 version, take main's modular gen-skill-docs resolvers, regenerate all SKILL.md files. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -356,6 +356,18 @@ Linux cookie import shipped in v0.11.11.0 (Wave 3). Supports Chrome, Chromium, B
|
||||
|
||||
|
||||
|
||||
### Extend worktree isolation to Claude E2E tests
|
||||
|
||||
**What:** Add `useWorktree?: boolean` option to `runSkillTest()` so any Claude E2E test can opt into worktree mode for full repo context instead of tmpdir fixtures.
|
||||
|
||||
**Why:** Some Claude E2E tests (CSO audit, review-sql-injection) create minimal fake repos but would produce more realistic results with full repo context. The infrastructure exists (`describeWithWorktree()` in e2e-helpers.ts) — this extends it to the session-runner level.
|
||||
|
||||
**Context:** WorktreeManager shipped in v0.11.12.0. Currently only Gemini/Codex tests use worktrees. Claude tests use planted-bug fixture repos which are correct for their purpose, but new tests that want real repo context can use `describeWithWorktree()` today. This TODO is about making it even easier via a flag on `runSkillTest()`.
|
||||
|
||||
**Effort:** M (human: ~2 days / CC: ~20 min)
|
||||
**Priority:** P3
|
||||
**Depends on:** Worktree isolation (shipped v0.11.12.0)
|
||||
|
||||
### E2E model pinning — SHIPPED
|
||||
|
||||
~~**What:** Pin E2E tests to claude-sonnet-4-6 for cost efficiency, add retry:2 for flaky LLM responses.~~
|
||||
|
||||
Reference in New Issue
Block a user