docs: update README, CONTRIBUTING, ARCHITECTURE for v0.3.6

Update test tier costs and commands (Agent SDK → claude -p, SKILL_E2E → EVALS),
add E2E observability section to CONTRIBUTING and ARCHITECTURE, add testing
quick-start to README.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
Garry Tan
2026-03-14 12:47:00 -05:00
parent 4ace0c2f6f
commit 43fbe165a4
3 changed files with 139 additions and 23 deletions
+11 -1
View File
@@ -619,7 +619,17 @@ Paste this into Claude Code:
## Development
See [BROWSER.md](BROWSER.md) for the full development guide, architecture, and command reference.
See [CONTRIBUTING.md](CONTRIBUTING.md) for setup, testing, and dev mode. See [ARCHITECTURE.md](ARCHITECTURE.md) for design decisions and system internals. See [BROWSER.md](BROWSER.md) for the browse command reference.
### Testing
```bash
bun test # free static tests (<5s)
EVALS=1 bun run test:evals # full E2E + LLM evals (~$4, ~20min)
bun run eval:watch # live dashboard during E2E runs
```
E2E tests stream real-time progress, write machine-readable diagnostics, and persist partial results that survive kills. See CONTRIBUTING.md for the full eval infrastructure.
## License