Beyond Rewards in Reinforcement Learning for Cyber Defence

Bates et al · 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Measuring Security Without Fooling Ourselves: Why Benchmarking Agents Is Hard

cs.CR · 2026-05-21 · unverdicted · novelty 4.0

This paper characterizes three challenges—benchmark vulnerabilities, temporal staleness, and runtime uncertainty—that undermine security evaluations of AI agents and outlines directions for more robust frameworks.

citing papers explorer

Showing 1 of 1 citing paper.

Measuring Security Without Fooling Ourselves: Why Benchmarking Agents Is Hard cs.CR · 2026-05-21 · unverdicted · none · ref 12
This paper characterizes three challenges—benchmark vulnerabilities, temporal staleness, and runtime uncertainty—that undermine security evaluations of AI agents and outlines directions for more robust frameworks.

Beyond Rewards in Reinforcement Learning for Cyber Defence

fields

years

verdicts

representative citing papers

citing papers explorer