mirror of
https://github.com/garrytan/gstack.git
synced 2026-06-24 10:39:57 +02:00
test(parity): rebase parity baseline v1.53.0.0 -> v1.57.7.0
The v1.53 anchor is four minor versions stale. v1.54-v1.57 (ship/plan carving, carve-guards, AUQ prose fallback, the cross-session decision-log preamble) plus this branch's mandatory unresolved-decisions status line pushed the three plan-review skills past the 5% ratchet even after exhaustive compression. The new baseline captures current UNION sizes (skeleton + sections/*.md, matching what parity-harness measures) so the per-skill 1.05 ratio keeps catching future bloat. The frozen v1.44.1 integrity anchor and the v1.47 size-budget baseline are untouched. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -2,15 +2,19 @@
|
||||
* Cathedral parity suite — gate-tier (free, structural + content checks).
|
||||
*
|
||||
* Runs every PARITY_INVARIANTS check against the current SKILL.md output
|
||||
* vs the v1.53.0.0 baseline. Failures get an actionable, per-skill report
|
||||
* vs the v1.57.7.0 baseline. Failures get an actionable, per-skill report
|
||||
* showing missing phrases, missing headings, and size ratios.
|
||||
*
|
||||
* Baseline rebased v1.44.1 → v1.53.0.0: the brain-aware-planning releases
|
||||
* (v1.49–v1.52) plus the v1.53 redaction guard pushed five planning skills
|
||||
* past the 5% ratchet on the frozen v1.44.1 anchor. Rebasing absorbs that
|
||||
* legitimate growth at HEAD while keeping the per-skill 1.05 ratio so future
|
||||
* bloat is still caught. Historical v1.44.1 / v1.46.0.0 / v1.47.0.0 baselines
|
||||
* are retained in test/fixtures/ for the v1→v2 audit trail.
|
||||
* Baseline rebased v1.53.0.0 → v1.57.7.0: the v1.54–v1.57 releases (ship/plan
|
||||
* carving, carve-guards, AUQ prose fallback, the cross-session decision-log
|
||||
* preamble) plus the mandatory unresolved-decisions status added to every
|
||||
* GSTACK REVIEW REPORT pushed the three plan-review skills past the 5% ratchet
|
||||
* on the v1.53 anchor even after exhaustive compression. The v1.57.7.0 baseline
|
||||
* captures current UNION sizes (skeleton + sections/*.md, matching what the
|
||||
* harness measures) so the per-skill 1.05 ratio still catches future bloat.
|
||||
* Earlier rebase v1.44.1 → v1.53.0.0: brain-aware-planning (v1.49–v1.52) + the
|
||||
* v1.53 redaction guard. Historical v1.44.1 / v1.46.0.0 / v1.47.0.0 / v1.53.0.0
|
||||
* baselines are retained in test/fixtures/ for the audit trail.
|
||||
*
|
||||
* Periodic-tier LLM-judge parity (paid) lands in Phase B (v2.0.0.0)
|
||||
* alongside the sections/ extraction. Plumbing is in parity-harness.ts.
|
||||
@@ -23,9 +27,9 @@ import { runParityChecks, PARITY_INVARIANTS } from './helpers/parity-harness';
|
||||
import type { ParityBaseline } from './helpers/capture-parity-baseline';
|
||||
|
||||
const REPO_ROOT = path.resolve(import.meta.dir, '..');
|
||||
const BASELINE_PATH = path.join(REPO_ROOT, 'test', 'fixtures', 'parity-baseline-v1.53.0.0.json');
|
||||
const BASELINE_PATH = path.join(REPO_ROOT, 'test', 'fixtures', 'parity-baseline-v1.57.7.0.json');
|
||||
|
||||
describe('parity suite vs v1.53.0.0 baseline (gate, free)', () => {
|
||||
describe('parity suite vs v1.57.7.0 baseline (gate, free)', () => {
|
||||
test('baseline exists', () => {
|
||||
expect(fs.existsSync(BASELINE_PATH)).toBe(true);
|
||||
});
|
||||
|
||||
Reference in New Issue
Block a user