Replace teaser with Pareto front evolution plot

Assisted-by: Claude <noreply@anthropic.com>
This commit is contained in:
Peter Romov
2026-03-27 00:20:39 +00:00
parent 99e80657a5
commit 48eb0f155c
3 changed files with 1 additions and 1 deletions
+1 -1
View File
@@ -5,7 +5,7 @@
[![arXiv](https://img.shields.io/badge/arXiv-2603.24511-b31b1b.svg?logo=arxiv)](https://arxiv.org/abs/2603.24511)
<p align="center">
<img src="assets/teaser.png" width="90%" alt="Claude autoresearch vs Optuna hyperparameter search: best train and validation loss over trials">
<img src="assets/pareto_evolution_small.png" width="90%" alt="Pareto front with evolution annotations">
</p>
We show that an *[autoresearch](https://github.com/karpathy/autoresearch)*-style pipeline powered by Claude Code discovers novel white-box adversarial attack *algorithms* that **significantly outperform** all existing [methods](claudini/methods/original/README.md) in jailbreaking and prompt injection evaluations.
Binary file not shown.

After

Width:  |  Height:  |  Size: 322 KiB

BIN
View File
Binary file not shown.

Before

Width:  |  Height:  |  Size: 273 KiB