arXiv preprint arXiv:2512.08864(2025)

Barrett, S · 2025 · arXiv 2512.08864

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Exploring Systems-Thinking Approaches to Loss of Control Risk

cs.CY · 2026-06-11 · unverdicted · novelty 5.0 · 2 refs

Systems analyses of a frontier-lab AI coding agent scenario using STECA, STPA, and FRAM reveal unverifiable governance loops, ineffective control delays, and gradual safeguard erosion, supporting the addition of systems-level methods to model-focused AI evaluations.

The Emergence of Autonomous Penetration Capabilities in Large Language Model-Powered AI Systems

cs.CR · 2026-06-11 · unverdicted · novelty 5.0

A tiered server benchmark with 300 targets shows current LLMs achieve autonomous penetration success rates of 10.7-69.3% using only general cybersecurity tools and no target-specific knowledge.

citing papers explorer

Showing 2 of 2 citing papers.

Exploring Systems-Thinking Approaches to Loss of Control Risk cs.CY · 2026-06-11 · unverdicted · none · ref 8 · 2 links
Systems analyses of a frontier-lab AI coding agent scenario using STECA, STPA, and FRAM reveal unverifiable governance loops, ineffective control delays, and gradual safeguard erosion, supporting the addition of systems-level methods to model-focused AI evaluations.
The Emergence of Autonomous Penetration Capabilities in Large Language Model-Powered AI Systems cs.CR · 2026-06-11 · unverdicted · none · ref 14
A tiered server benchmark with 300 targets shows current LLMs achieve autonomous penetration success rates of 10.7-69.3% using only general cybersecurity tools and no target-specific knowledge.

arXiv preprint arXiv:2512.08864(2025)

fields

years

verdicts

representative citing papers

citing papers explorer