pith. sign in

← back to paper

Review history

arxiv: 2605.09027 · 2 revisions

GAMBIT: A Three-Mode Benchmark for Adversarial Robustness in Multi-Agent LLM Collectives

  1. 2026-05-14 CONDITIONAL LOW v0.9.0 novelty 7.0
    36902 ms 5639 in 1492 out 2026-05-14T21:05:26.707590+00:00
  2. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 8.0
    109726 ms 5639 in 1344 out 2026-05-12T02:46:12.540926+00:00