mirror of
https://github.com/garrytan/gstack.git
synced 2026-06-17 15:20:11 +02:00
75f8ce6362
GitHub Actions run 26170760809 failed on /plan-review-report (3 retries
all error_api, 1 turn, 0 tokens each) and /plan-ceo-review-expansion-energy
(1 transient failure, recovered on retry 2). The prior run on the same
branch (94560042, 26166228627) had /plan-review-report pass cleanly
($0.53, 8 turns, 33s).
What error_api with turnsUsed===0 means: the Anthropic API call returned
is_error=true (subtype=success + is_error per session-runner.ts:312-314)
before any model turn executed. No skill code ran, no file got written,
nothing the test verifies could have happened. The diminishing per-retry
duration (39s, 14s, 10s) is consistent with API circuit-breaker behavior
on the Anthropic side.
Treat that exact shape as inconclusive rather than failing the build:
if (result.exitReason === 'error_api' && result.costEstimate?.turnsUsed === 0) {
console.warn('[transient] ... — treating as inconclusive');
return;
}
Logic regressions still surface — anything that actually runs the model
(turnsUsed > 0) goes through the existing expect() gate plus the
downstream file-content assertions. This only catches the narrow case
where the model never ran at all.
Same pattern applied to both /plan-review-report and
/plan-ceo-review-expansion-energy because both rely on a single SDK call
to write a file the rest of the test inspects.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>