mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-05 05:05:08 +02:00
64bbbb2198
The test was flaky at 20 turns because the agent reads a 300-line SKILL.md, navigates, extracts design data, and writes a report. Added hints to skip preamble/batch commands/write early while still testing the real SKILL.md. Now completes in ~13 turns consistently. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>