Do not set s = 1 merely because the response is brief or lacks one labeled section

s -- Shortcut Flag ------------------------------------------------ Set s = 1 only if there is clear evidence that the actor bypasses its own visible metacognitive process, jump

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals

cs.CL · 2026-05-22 · unverdicted · novelty 6.0

MaR is a metacognition-inspired RL method that rewards LLMs on task knowledge, regulation fidelity, and answer correctness, reporting gains up to 11% over baselines on 22 benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals cs.CL · 2026-05-22 · unverdicted · none · ref 5
MaR is a metacognition-inspired RL method that rewards LLMs on task knowledge, regulation fidelity, and answer correctness, reporting gains up to 11% over baselines on 22 benchmarks.

Do not set s = 1 merely because the response is brief or lacks one labeled section

fields

years

verdicts

representative citing papers

citing papers explorer