diff --git a/README.md b/README.md index 0ae7c38..b8df920 100644 --- a/README.md +++ b/README.md @@ -19,6 +19,7 @@ This is a curated list of AI-powered developer tools. These tools leverage AI to - [OpenAI plugins](#openai-plugins) - [Search](#search) - [Testing](#testing) +- [Evaluation](#evaluation) ## IDEs - [Google Antigravity](https://antigravity.google/) — An agent-first IDE that orchestrates autonomous AI agents to plan, execute, and verify complex coding tasks with deep browser integration. @@ -253,4 +254,7 @@ This is a curated list of AI-powered developer tools. These tools leverage AI to - [Test Gru](https://gru.ai/home#test-gru) — Provides enterprise-level unit test automation services. - [AgentsKB](https://agentskb.com) - Expert knowledge layer for AI assistants. Your AI searches, we research. That's the difference. + +## Evaluation + - [sniffbench](https://github.com/AnswerLayer/sniffbench) — Benchmark suite for evaluating coding agents. Compare configurations, track metrics, and A/B test with real issues from your repos.