chore: bump version to 0.3.3, update changelog

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-06-18 15:50:11 +02:00 · 2026-03-13 22:35:31 -05:00
parent 9dffb1ed16
commit e6c7348006
3 changed files with 25 additions and 2 deletions
@@ -1,5 +1,28 @@
 # Changelog

+## 0.3.3 — 2026-03-13
+
+### Added
+- **SKILL.md template system** — `.tmpl` files with `{{COMMAND_REFERENCE}}` and `{{SNAPSHOT_FLAGS}}` placeholders, auto-generated from source code at build time. Structurally prevents command drift between docs and code.
+- **Command registry** (`browse/src/commands.ts`) — single source of truth for all browse commands with categories and enriched descriptions. Zero side effects, safe to import from build scripts and tests.
+- **Snapshot flags metadata** (`SNAPSHOT_FLAGS` array in `browse/src/snapshot.ts`) — metadata-driven parser replaces hand-coded switch/case. Adding a flag in one place updates the parser, docs, and tests.
+- **Tier 1 static validation** — 43 tests: parses `$B` commands from SKILL.md code blocks, validates against command registry and snapshot flag metadata
+- **Tier 2 E2E tests** via Agent SDK — spawns real Claude sessions, runs skills, scans for browse errors. Gated by `SKILL_E2E=1` env var (~$0.50/run)
+- **Tier 3 LLM-as-judge evals** — Haiku scores generated docs on clarity/completeness/actionability (threshold ≥4/5), plus regression test vs hand-maintained baseline. Gated by `ANTHROPIC_API_KEY`
+- **`bun run skill:check`** — health dashboard showing all skills, command counts, validation status, template freshness
+- **`bun run dev:skill`** — watch mode that regenerates and validates SKILL.md on every template or source file change
+- **CI workflow** (`.github/workflows/skill-docs.yml`) — runs `gen:skill-docs` on push/PR, fails if generated output differs from committed files
+- `bun run gen:skill-docs` script for manual regeneration
+- `bun run test:eval` for LLM-as-judge evals
+- `test/helpers/skill-parser.ts` — extracts and validates `$B` commands from Markdown
+- `test/helpers/session-runner.ts` — Agent SDK wrapper with error pattern scanning and transcript saving
+
+### Changed
+- Build now runs `gen:skill-docs` before compiling binaries
+- `parseSnapshotArgs` is metadata-driven (iterates `SNAPSHOT_FLAGS` instead of switch/case)
+- `server.ts` imports command sets from `commands.ts` instead of declaring inline
+- SKILL.md and browse/SKILL.md are now generated files (edit the `.tmpl` instead)
+
 ## 0.3.2 — 2026-03-13

 ### Fixed
@@ -1 +1 @@
-0.3.2
+0.3.3
@@ -1,6 +1,6 @@
 {
  "name": "gstack",
-  "version": "0.3.2",
+  "version": "0.3.3",
  "description": "Garry's Stack — Claude Code skills + fast headless browser. One repo, one install, entire AI engineering workflow.",
  "license": "MIT",
  "type": "module",
@@ -1 +1 @@
 .3.2
 .3.3