From 1f2d353018b1a1a90db41edbf90669e2fc5dfb2a Mon Sep 17 00:00:00 2001 From: Garry Tan Date: Tue, 24 Mar 2026 08:18:27 -0700 Subject: [PATCH] chore: bump version and changelog (v0.11.16.0) Co-Authored-By: Claude Opus 4.6 --- CHANGELOG.md | 15 +++++++++++++++ VERSION | 2 +- package.json | 2 +- 3 files changed, 17 insertions(+), 2 deletions(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index 4a7ff0d9..ed2a18db 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,5 +1,20 @@ # Changelog +## [0.11.16.0] - 2026-03-24 — Smarter CI: 2-Tier E2E Test System + +### Changed + +- **CI runs only gate tests by default — periodic tests run weekly.** Every E2E test is now classified as `gate` (blocks PRs) or `periodic` (weekly cron + on-demand). Gate tests cover functional correctness and safety guardrails. Periodic tests cover expensive Opus quality benchmarks, non-deterministic routing tests, and tests requiring external services (Codex, Gemini). CI feedback is faster and cheaper while quality benchmarks still run weekly. +- **Global touchfiles are now granular.** Previously, changing `gen-skill-docs.ts` triggered all 56 E2E tests. Now only the ~27 tests that actually depend on it run. Same for `llm-judge.ts`, `test-server.ts`, `worktree.ts`, and the Codex/Gemini session runners. The truly global list is down to 3 files (session-runner, eval-store, touchfiles.ts itself). +- **New `test:gate` and `test:periodic` scripts** replace `test:e2e:fast`. Use `EVALS_TIER=gate` or `EVALS_TIER=periodic` to filter tests by tier. + +### For contributors + +- `E2E_TIERS` map in `test/helpers/touchfiles.ts` classifies every test — a free validation test ensures it stays in sync with `E2E_TOUCHFILES` +- `EVALS_FAST` / `FAST_EXCLUDED_TESTS` removed in favor of `EVALS_TIER` +- `allow_failure` removed from CI matrix (gate tests should be reliable) +- New `.github/workflows/evals-periodic.yml` runs periodic tests Monday 6 AM UTC + ## [0.11.15.0] - 2026-03-24 — E2E Test Coverage for Plan Reviews & Codex ### Added diff --git a/VERSION b/VERSION index 446cced3..e36c939e 100644 --- a/VERSION +++ b/VERSION @@ -1 +1 @@ -0.11.15.0 +0.11.16.0 diff --git a/package.json b/package.json index 7ba726fe..0130b351 100644 --- a/package.json +++ b/package.json @@ -1,6 +1,6 @@ { "name": "gstack", - "version": "0.11.15.0", + "version": "0.11.16.0", "description": "Garry's Stack — Claude Code skills + fast headless browser. One repo, one install, entire AI engineering workflow.", "license": "MIT", "type": "module",