Commit Graph

  • e34fd126f9 Add claudini.asr: compute ASR from benchmark results fix/safeguard-valid-skip-broken-samples Alexander Panfilov 2026-04-10 22:11:01 +02:00
  • 6e170551a2 safeguard_valid: drop 10 samples that trip gpt-oss attention bug Alexander Panfilov 2026-04-10 21:43:23 +02:00
  • 238c702b06 Default filter_ascii to False to match legacy benchmark behavior (#3) main Alexander Panfilov 2026-04-10 13:53:34 +03:00
  • 8221f0afb8 Add leaderboard generation script (#2) v0.1 Alexander Panfilov 2026-04-06 18:55:32 +03:00
  • 59106bdf3c SecAlign-70B support: configs, quantization, multi-GPU (#1) Peter Romov 2026-04-06 13:36:08 +00:00
  • 48eb0f155c Replace teaser with Pareto front evolution plot Peter Romov 2026-03-27 00:20:39 +00:00
  • 99e80657a5 Remove empty claude_demo method directory Peter Romov 2026-03-26 17:37:18 +00:00
  • 69c04a2b9e Add autoresearch skill, update configs and README Peter Romov 2026-03-26 16:28:14 +00:00
  • 4c938fd325 Update arxiv link and citation (arXiv:2603.24511) Peter Romov 2026-03-26 10:05:29 +00:00
  • 63974ddfee Add unrolled methods (claude_oss_v53, claude_v63) with README Peter Romov 2026-03-25 18:27:52 +00:00
  • 5b6058b3c4 Initial commit Peter Romov 2026-03-25 02:09:26 +00:00