Toward cybersecurity-expert small language models

Xiaoxiao Yu et al · 2025 · arXiv 2510.14113

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

cs.CR · 2026-05-27 · unverdicted · novelty 7.0

CAI Dataset is presented as the largest described corpus of LLM-driven hacker trajectories, with the claim that operator data concentration in frontier-model providers creates a major security risk best addressed by on-premise specialized LLMs.

Dynamic Cyber Ranges

cs.CR · 2026-04-27 · unverdicted · novelty 7.0

Dynamic Cyber Ranges with LLM defender agents reduce attacker success to 0-55% and preserve evaluation headroom as models advance by using comparable capabilities on both sides.

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

cs.LG · 2026-01-31 · unverdicted · novelty 7.0

MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

A Red Teaming Framework for Evaluating Robustness of AI-enabled Security Orchestration, Automation, and Response Systems

cs.CR · 2026-05-16 · unverdicted · novelty 6.0

A hybrid LLM-RL red teaming framework generates adaptive attack campaigns in simulated enterprise networks to evaluate the robustness of AI-enabled SOAR systems.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Cybersecurity AI (CAI) Dataset cs.CR · 2026-05-27 · unverdicted · none · ref 51
CAI Dataset is presented as the largest described corpus of LLM-driven hacker trajectories, with the claim that operator data concentration in frontier-model providers creates a major security risk best addressed by on-premise specialized LLMs.
Dynamic Cyber Ranges cs.CR · 2026-04-27 · unverdicted · none · ref 65
Dynamic Cyber Ranges with LLM defender agents reduce attacker success to 0-55% and preserve evaluation headroom as models advance by using comparable capabilities on both sides.
A Red Teaming Framework for Evaluating Robustness of AI-enabled Security Orchestration, Automation, and Response Systems cs.CR · 2026-05-16 · unverdicted · none · ref 19
A hybrid LLM-RL red teaming framework generates adaptive attack campaigns in simulated enterprise networks to evaluate the robustness of AI-enabled SOAR systems.

Toward cybersecurity-expert small language models

fields

years

verdicts

representative citing papers

citing papers explorer