Files
gstack/test
Garry Tan 0adc71a13b fix: lower command reference completeness threshold to 3
The LLM judge consistently scores the command reference table's
completeness at 3/5 because it's a terse quick-reference format.
Detailed argument docs live in per-command sections, not the summary
table. The baseline already expects 3 — align the direct test threshold.
2026-03-24 14:27:11 -07:00
..
2026-03-24 14:19:25 -07:00