Confidence in the reasoning of large language models

Yudi Pawitan, Chris Holmes · 2024 · arXiv 2412.15296

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident, Especially When They are Wrong

cs.CL · 2025-01-16 · unverdicted · novelty 5.0

Reasoning before answering MCQs increases LLM confidence more for incorrect answers and degrades calibration on a 57-subject benchmark across seven models.

citing papers explorer

Showing 1 of 1 citing paper.

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident, Especially When They are Wrong cs.CL · 2025-01-16 · unverdicted · none · ref 12
Reasoning before answering MCQs increases LLM confidence more for incorrect answers and degrades calibration on a 57-subject benchmark across seven models.

Confidence in the reasoning of large language models

fields

years

verdicts

representative citing papers

citing papers explorer