Garry Tan
86bc2e993c
test(office-hours): retier builder-wildness from gate to periodic
...
The office-hours-builder-wildness E2E is an LLM-judge creativity score
(axis_a ≥4 on /office-hours BUILDER output, axis_b ≥4 on same).
Per CLAUDE.md tier-classification rules — "Quality benchmark, Opus model
test, or non-deterministic? -> periodic" — this test belongs in periodic,
not gate.
The wave's +21-line CJK preamble cascade (#1205 ) dropped the same prompt
from a 5/5 score on main to 3/3 on the wave with identical model + fixture
+ retry budget. Same generator, same judge, different preamble byte count
in the run-time context. That's noise the gate tier shouldn't surface as
a blocking failure.
Functional gates (office-hours-spec-review, office-hours-forcing-energy)
remain on gate — they test structure, not creativity.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com >
2026-05-11 09:47:12 -07:00
..
2026-05-10 11:18:57 -07:00
2026-05-11 09:47:12 -07:00
2026-04-24 00:04:53 -07:00
2026-03-18 23:57:59 -05:00
2026-04-19 17:50:31 +08:00
2026-04-19 17:50:31 +08:00
2026-04-19 17:50:31 +08:00
2026-05-06 19:37:53 -07:00
2026-04-08 22:21:28 -10:00
2026-04-23 07:25:20 -07:00
2026-03-23 23:05:22 -07:00
2026-04-18 12:30:54 +08:00
2026-05-09 08:06:47 -07:00
2026-04-19 08:38:19 +08:00
2026-03-30 22:07:50 -06:00
2026-04-26 13:55:13 -07:00
2026-04-18 15:05:42 +08:00
2026-04-24 01:38:21 -07:00
2026-04-24 01:38:21 -07:00
2026-04-24 01:38:21 -07:00
2026-05-04 09:29:48 -07:00
2026-04-24 01:38:21 -07:00
2026-04-16 10:41:38 -07:00
2026-05-10 11:18:57 -07:00
2026-04-05 02:02:06 -07:00
2026-05-06 19:37:53 -07:00
2026-05-06 19:37:53 -07:00
2026-05-02 08:40:30 -07:00
2026-04-18 15:05:42 +08:00
2026-05-06 19:37:53 -07:00
2026-05-06 19:37:53 -07:00
2026-04-28 01:17:54 -07:00
2026-05-08 12:46:15 -07:00
2026-05-02 08:40:30 -07:00
2026-05-10 11:06:00 -07:00
2026-04-23 23:03:27 -07:00
2026-05-01 07:21:28 -07:00
2026-04-18 15:05:42 +08:00
2026-04-18 15:05:42 +08:00
2026-04-28 01:17:54 -07:00
2026-04-26 13:55:13 -07:00
2026-05-10 06:57:24 -07:00
2026-04-17 00:45:13 -07:00
2026-04-18 15:05:42 +08:00
2026-04-06 00:47:04 -07:00
2026-03-29 17:02:01 -06:00
2026-05-01 19:51:51 -07:00
2026-05-07 20:14:59 -07:00
2026-04-19 08:38:19 +08:00
2026-05-06 19:37:53 -07:00
2026-04-25 11:52:48 -07:00
2026-05-06 19:37:53 -07:00
2026-04-17 00:45:13 -07:00
2026-04-18 15:05:42 +08:00
2026-05-06 19:37:53 -07:00
2026-05-01 07:06:37 -07:00
2026-04-23 18:25:34 -07:00
2026-04-18 15:05:42 +08:00
2026-04-16 15:39:44 -07:00
2026-04-23 18:25:34 -07:00
2026-03-26 23:21:27 -06:00
2026-04-24 01:38:21 -07:00
2026-04-18 12:30:54 +08:00
2026-05-06 19:37:53 -07:00
2026-05-09 08:06:47 -07:00
2026-04-18 23:58:59 +08:00
2026-04-26 13:55:13 -07:00
2026-04-19 08:38:19 +08:00
2026-05-01 19:51:51 -07:00
2026-04-26 13:55:13 -07:00
2026-05-01 08:45:36 -07:00
2026-04-26 13:55:13 -07:00
2026-04-19 08:38:19 +08:00
2026-05-11 09:46:21 -07:00
2026-05-06 19:37:53 -07:00
2026-04-23 17:54:54 -07:00
2026-04-19 08:38:19 +08:00
2026-03-23 06:57:22 -07:00
2026-03-26 11:08:31 -07:00
2026-05-10 11:08:26 -07:00
2026-03-31 23:08:22 -06:00
2026-05-04 09:29:48 -07:00
2026-05-01 08:45:36 -07:00
2026-05-01 19:51:51 -07:00
2026-04-19 05:44:39 +08:00
2026-04-22 01:06:22 -07:00
2026-04-23 18:42:58 -07:00
2026-04-30 02:50:09 -07:00
2026-05-06 20:27:20 -07:00
2026-04-30 02:50:09 -07:00
2026-05-09 17:01:13 -07:00
2026-04-30 02:50:09 -07:00
2026-05-06 20:27:20 -07:00
2026-05-09 17:01:13 -07:00
2026-04-26 13:55:13 -07:00
2026-04-30 02:50:09 -07:00
2026-05-06 20:27:20 -07:00
2026-05-03 20:26:59 -07:00
2026-04-30 02:50:09 -07:00
2026-05-06 20:27:20 -07:00
2026-05-09 17:01:13 -07:00
2026-05-09 17:01:13 -07:00
2026-05-01 19:51:51 -07:00
2026-04-26 13:55:13 -07:00
2026-04-23 18:25:34 -07:00
2026-04-18 15:05:42 +08:00
2026-05-10 11:08:26 -07:00
2026-05-10 11:08:26 -07:00
2026-03-23 10:17:33 -07:00
2026-03-30 22:07:50 -06:00
2026-05-10 11:08:26 -07:00
2026-04-19 08:38:19 +08:00
2026-05-06 19:37:53 -07:00
2026-05-06 19:37:53 -07:00
2026-04-26 13:55:13 -07:00
2026-04-04 15:32:20 -07:00
2026-04-28 20:08:04 -07:00
2026-05-10 11:08:26 -07:00
2026-03-31 23:08:22 -06:00
2026-03-26 17:31:53 -06:00
2026-03-13 21:08:12 -07:00
2026-04-16 10:41:38 -07:00
2026-05-09 08:06:47 -07:00
2026-04-19 17:50:31 +08:00
2026-04-22 01:06:22 -07:00
2026-03-29 21:43:36 -06:00
2026-05-01 07:21:28 -07:00
2026-04-01 00:50:42 -06:00
2026-05-09 17:01:13 -07:00
2026-03-27 00:44:37 -06:00
2026-04-18 15:05:42 +08:00
2026-04-18 15:05:42 +08:00
2026-04-06 14:41:06 -07:00
2026-04-26 13:55:13 -07:00