Each numbered item is exactly one gold knowledge unit

Gold Knowledge ------------------------------------------------ These are the only references for evaluating k

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals

cs.CL · 2026-05-22 · unverdicted · novelty 6.0

MaR is a metacognition-inspired RL method that rewards LLMs on task knowledge, regulation fidelity, and answer correctness, reporting gains up to 11% over baselines on 22 benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals cs.CL · 2026-05-22 · unverdicted · none · ref 1
MaR is a metacognition-inspired RL method that rewards LLMs on task knowledge, regulation fidelity, and answer correctness, reporting gains up to 11% over baselines on 22 benchmarks.

Each numbered item is exactly one gold knowledge unit

fields

years

verdicts

representative citing papers

citing papers explorer