ARENA: An Architecture for Measuring the Transferability of Autonomous Cyber Defense

\'Agney Lopes Roth Ferraz; Gioliano de Oliveira Braga; Henrique Curi de Miranda; Louren\c{c}o Alves Pereira J\'unior; Sidnei Barbieri; Wagner Comin Sonaglio

arxiv: 2606.21377 · v1 · pith:DOGFLU3Ynew · submitted 2026-06-19 · 💻 cs.CR

ARENA: An Architecture for Measuring the Transferability of Autonomous Cyber Defense

Sidnei Barbieri , \'Agney Lopes Roth Ferraz , Wagner Comin Sonaglio , Gioliano de Oliveira Braga , Henrique Curi de Miranda , Louren\c{c}o Alves Pereira J\'unior This is my paper

classification 💻 cs.CR

keywords boundaryproductiondataevidencefailsresearchresultsecurity

0 comments

read the original abstract

Operational evidence is not automatically scientific evidence. The most realistic Security Operations Center (SOC) data is production telemetry, yet it remains scientifically inaccessible because raw logs cannot be released; as a result, research relies on synthetic or dated datasets. We treat the boundary between private production telemetry and reusable research artifacts as the design object: a methodology that extracts, anonymizes, structures, and validates Security Information and Event Management (SIEM) data from a production financial SOC while preserving task-relevant investigative structure within a declared privacy boundary. Two consumers stress the same artifact. As training material, it fails loudly: 37 MITRE ATT&CK-mapped HIKARI challenges work only when anonymization preserves temporal order and entity consistency. As a measurement substrate, it fails quietly: across 200 SOCpilot incidents, a deterministic verifier detects non-compliant Large Language Model (LLM) actions that are absent from the human baseline. The result is a measurable privacy-utility boundary rather than a formal anonymity claim.

This paper has not been read by Pith yet.

ARENA: An Architecture for Measuring the Transferability of Autonomous Cyber Defense

discussion (0)