Input includes report context and a question with options; output is a JSON object wrapped in<json object> tags containing a correct answers list

SOCEval(Deason et al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

cs.LG · 2026-01-31 · unverdicted · novelty 7.0

MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs cs.LG · 2026-01-31 · unverdicted · none · ref 31
MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

Input includes report context and a question with options; output is a JSON object wrapped in<json object> tags containing a correct answers list

fields

years

verdicts

representative citing papers

citing papers explorer