perf: pin quality tests to Opus, add --retry 2 and test:e2e:fast tier

~10 quality-sensitive tests (planted-bug detection, design quality judge,
strategic review, retro analysis) explicitly pinned to Opus. ~30 structure
tests default to Sonnet for 5x speed improvement.

Added --retry 2 to all E2E scripts for flaky test resilience.
Added test:e2e:fast script that excludes 8 slowest tests for quick feedback.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Garry Tan
2026-03-21 13:12:28 -07:00
parent ce4a5768fe
commit fa61e2fe3c
4 changed files with 9 additions and 2 deletions
+1
View File
@@ -100,6 +100,7 @@ CRITICAL RULES:
timeout: 300_000,
testName: `qa-${label}`,
runId,
model: 'claude-opus-4-6',
});
logCost(`/qa ${label}`, result);