pith. sign in

CTFusion: A CTF-based benchmark for LLM agent evaluation.OpenReview (under review)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CR 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Dynamic Cyber Ranges

cs.CR · 2026-04-27 · unverdicted · novelty 7.0

Dynamic Cyber Ranges with LLM defender agents reduce attacker success to 0-55% and preserve evaluation headroom as models advance by using comparable capabilities on both sides.

citing papers explorer

Showing 1 of 1 citing paper.

  • Dynamic Cyber Ranges cs.CR · 2026-04-27 · unverdicted · none · ref 23

    Dynamic Cyber Ranges with LLM defender agents reduce attacker success to 0-55% and preserve evaluation headroom as models advance by using comparable capabilities on both sides.