mirror of
https://github.com/romovpa/claudini.git
synced 2026-04-22 18:15:58 +02:00
Replace teaser with Pareto front evolution plot
Assisted-by: Claude <noreply@anthropic.com>
This commit is contained in:
@@ -5,7 +5,7 @@
|
||||
[](https://arxiv.org/abs/2603.24511)
|
||||
|
||||
<p align="center">
|
||||
<img src="assets/teaser.png" width="90%" alt="Claude autoresearch vs Optuna hyperparameter search: best train and validation loss over trials">
|
||||
<img src="assets/pareto_evolution_small.png" width="90%" alt="Pareto front with evolution annotations">
|
||||
</p>
|
||||
|
||||
We show that an *[autoresearch](https://github.com/karpathy/autoresearch)*-style pipeline powered by Claude Code discovers novel white-box adversarial attack *algorithms* that **significantly outperform** all existing [methods](claudini/methods/original/README.md) in jailbreaking and prompt injection evaluations.
|
||||
|
||||
Binary file not shown.
|
After Width: | Height: | Size: 322 KiB |
Binary file not shown.
|
Before Width: | Height: | Size: 273 KiB |
Reference in New Issue
Block a user