Reasoning-finetuning repurposes latent representations in base models.arXiv:2507.12638

Jake Ward, Chuqiao Lin, Constantin Venhoff, Neel Nanda · arXiv 2507.12638

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

LLMs implement a second-order confidence architecture where the PANL activation encodes both error likelihood and the ability to correct it, beyond verbal confidence or log-probabilities.

citing papers explorer

Showing 1 of 1 citing paper.

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals cs.LG · 2026-04-24 · unverdicted · none · ref 24
LLMs implement a second-order confidence architecture where the PANL activation encodes both error likelihood and the ability to correct it, beyond verbal confidence or log-probabilities.

Reasoning-finetuning repurposes latent representations in base models.arXiv:2507.12638

fields

years

verdicts

representative citing papers

citing papers explorer