diff --git a/ship/SKILL.md b/ship/SKILL.md
index ce61d03f..56c4a197 100644
--- a/ship/SKILL.md
+++ b/ship/SKILL.md
@@ -406,9 +406,9 @@ Only commit if there are changes. Stage all bootstrap files (config, test direct
## Step 3: Run tests (on merged code)
-Read CLAUDE.md. Look for a `## Testing` section or test commands in `## Commands`.
-Extract all test commands (lines in bash code blocks that run tests — e.g., `bun test`,
-`npm test`, `pytest`, `go test ./...`, `cargo test`, `bin/rails test`).
+Read CLAUDE.md. Look for a section about testing — `## Testing`, `## Test Commands`,
+`## Tests`, or test commands in `## Commands`. Extract all test commands (lines in
+bash code blocks or command tables that run tests — e.g., `bun test`, `npm test`, `pytest`, `go test ./...`, `cargo test`, `bin/rails test`).
Run all discovered test commands in parallel, each piped to a unique /tmp file:
@@ -448,8 +448,9 @@ If C: Skip tests with warning. Continue to Step 3.25.
## Step 3.25: Eval Suites (conditional)
-Read CLAUDE.md. Look for a `## Evals` section, or eval-related commands in `## Testing`
-or `## Commands` (identified by keywords: "eval", "evals", "judge", "llm-judge").
+Read CLAUDE.md. Look for a `## Evals` section, or eval-related commands in `## Testing`,
+`## Test Commands`, `## Tests`, or `## Commands` (identified by keywords: "eval", "evals",
+"judge", "llm-judge").
If an eval command is found:
Run it. The project's eval system handles diff-based file selection internally.
diff --git a/ship/SKILL.md.tmpl b/ship/SKILL.md.tmpl
index f7247b2c..5540ee2c 100644
--- a/ship/SKILL.md.tmpl
+++ b/ship/SKILL.md.tmpl
@@ -101,9 +101,9 @@ git fetch origin && git merge origin/ --no-edit
## Step 3: Run tests (on merged code)
-Read CLAUDE.md. Look for a `## Testing` section or test commands in `## Commands`.
-Extract all test commands (lines in bash code blocks that run tests — e.g., `bun test`,
-`npm test`, `pytest`, `go test ./...`, `cargo test`, `bin/rails test`).
+Read CLAUDE.md. Look for a section about testing — `## Testing`, `## Test Commands`,
+`## Tests`, or test commands in `## Commands`. Extract all test commands (lines in
+bash code blocks or command tables that run tests — e.g., `bun test`, `npm test`, `pytest`, `go test ./...`, `cargo test`, `bin/rails test`).
Run all discovered test commands in parallel, each piped to a unique /tmp file:
@@ -143,8 +143,9 @@ If C: Skip tests with warning. Continue to Step 3.25.
## Step 3.25: Eval Suites (conditional)
-Read CLAUDE.md. Look for a `## Evals` section, or eval-related commands in `## Testing`
-or `## Commands` (identified by keywords: "eval", "evals", "judge", "llm-judge").
+Read CLAUDE.md. Look for a `## Evals` section, or eval-related commands in `## Testing`,
+`## Test Commands`, `## Tests`, or `## Commands` (identified by keywords: "eval", "evals",
+"judge", "llm-judge").
If an eval command is found:
Run it. The project's eval system handles diff-based file selection internally.