Toward cybersecurity-expert small language models

Matan Levi, Daniel Ohayon, Ariel Blobstein, Ravid Sagi, Ian Molloy, Yair Allouche · 2025 · arXiv 2510.14113

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

cs.CR · 2026-04-27 · unverdicted · novelty 7.0

Dynamic Cyber Ranges with LLM defender agents reduce attacker success to 0-55% and preserve evaluation headroom as models advance by using comparable capabilities on both sides.

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

cs.LG · 2026-01-31 · unverdicted · novelty 7.0

MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

A Red Teaming Framework for Evaluating Robustness of AI-enabled Security Orchestration, Automation, and Response Systems

cs.CR · 2026-05-16 · unverdicted · novelty 6.0

A hybrid LLM-RL red teaming framework generates adaptive attack campaigns in simulated enterprise networks to evaluate the robustness of AI-enabled SOAR systems.

citing papers explorer

Showing 3 of 3 citing papers.

Dynamic Cyber Ranges cs.CR · 2026-04-27 · unverdicted · none · ref 65
Dynamic Cyber Ranges with LLM defender agents reduce attacker success to 0-55% and preserve evaluation headroom as models advance by using comparable capabilities on both sides.
Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs cs.LG · 2026-01-31 · unverdicted · none · ref 4
MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.
A Red Teaming Framework for Evaluating Robustness of AI-enabled Security Orchestration, Automation, and Response Systems cs.CR · 2026-05-16 · unverdicted · none · ref 19
A hybrid LLM-RL red teaming framework generates adaptive attack campaigns in simulated enterprise networks to evaluate the robustness of AI-enabled SOAR systems.

Toward cybersecurity-expert small language models

fields

years

verdicts

representative citing papers

citing papers explorer