Beyond the imitation game: Quantifying and extrapolating the capabilities of language models, 2023

Aarohi Srivastava et al · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident, Especially When They are Wrong

cs.CL · 2025-01-16 · unverdicted · novelty 5.0

Reasoning before answering MCQs increases LLM confidence more for incorrect answers and degrades calibration on a 57-subject benchmark across seven models.

citing papers explorer

Showing 1 of 1 citing paper.

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident, Especially When They are Wrong cs.CL · 2025-01-16 · unverdicted · none · ref 9
Reasoning before answering MCQs increases LLM confidence more for incorrect answers and degrades calibration on a 57-subject benchmark across seven models.

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models, 2023

fields

years

verdicts

representative citing papers

citing papers explorer