Frontier LLMs exhibit consistent domain-specific differences in metacognitive monitoring on the MMLU benchmark, with applied and professional knowledge domains showing the highest monitoring accuracy and formal reasoning and natural science the lowest.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Domain-level metacognitive monitoring in frontier LLMs: A 33-model atlas
Frontier LLMs exhibit consistent domain-specific differences in metacognitive monitoring on the MMLU benchmark, with applied and professional knowledge domains showing the highest monitoring accuracy and formal reasoning and natural science the lowest.