From f3e6b7e316e66c50665fa7131a87ad307119f278 Mon Sep 17 00:00:00 2001 From: Garry Tan Date: Tue, 17 Mar 2026 22:17:31 -0700 Subject: [PATCH] docs: add ## Testing section to CLAUDE.md for /ship discovery Dedicated section with the two commands /ship needs: bun test (free, every commit) and bun run test:evals (paid, before shipping). This is the section /ship Step 3 reads to discover what tests to run. Co-Authored-By: Claude Opus 4.6 (1M context) --- CLAUDE.md | 11 +++++++++++ 1 file changed, 11 insertions(+) diff --git a/CLAUDE.md b/CLAUDE.md index 769756f6..5c5a550f 100644 --- a/CLAUDE.md +++ b/CLAUDE.md @@ -30,6 +30,17 @@ on `git diff` against the base branch. Each test declares its file dependencies llm-judge, gen-skill-docs) trigger all tests. Use `EVALS_ALL=1` or the `:all` script variants to force all tests. Run `eval:select` to preview which tests would run. +## Testing + +```bash +bun test # run before every commit — free, <2s +bun run test:evals # run before shipping — paid, diff-based (~$4/run max) +``` + +`bun test` runs skill validation, gen-skill-docs quality checks, and browse +integration tests. `bun run test:evals` runs LLM-judge quality evals and E2E +tests via `claude -p`. Both must pass before creating a PR. + ## Project structure ```