Input is a prompt with five options (A–E); output is a single letter A–E on the final line (optional brief justification allowed)

CKT(Alam et al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

cs.LG · 2026-01-31 · unverdicted · novelty 7.0

MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs cs.LG · 2026-01-31 · unverdicted · none · ref 29
MinervaRL applies reinforcement learning with verifiable rewards from CTI standards to improve LLM structured output performance by 15.8 points over base models across 12 benchmarks.

Input is a prompt with five options (A–E); output is a single letter A–E on the final line (optional brief justification allowed)

fields

years

verdicts

representative citing papers

citing papers explorer