Input describes attacker behavior (Windows environment); output is a single technique IDT####orT####.###on the final line

ATE(Alam et al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

cs.LG · 2026-01-31 · unverdicted · novelty 7.0

MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs cs.LG · 2026-01-31 · unverdicted · none · ref 34
MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

Input describes attacker behavior (Windows environment); output is a single technique IDT####orT####.###on the final line

fields

years

verdicts

representative citing papers

citing papers explorer