Your pre-trained LLM is secretly an unsupervised confidence calibrator

Beier Luo, Shuoyuan Wang, Sharon Li, Hongxin Wei · 2025 · arXiv 2505.16690

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

Unsupervised single-generation confidence calibration for reasoning LLMs via offline self-consistency proxy distillation outperforms baselines on math and QA tasks and improves selective prediction.

LLMs Should Express Uncertainty Explicitly

cs.LG · 2026-04-07 · unverdicted · novelty 6.0 · 2 refs

Training LLMs to verbalize uncertainty explicitly at the end or during reasoning reduces overconfident errors and improves answer quality on factual tasks while enabling RAG triggers.

Towards Trustworthy Report Generation: A Deep Research Agent with Progressive Confidence Estimation and Calibration

cs.AI · 2026-04-07 · unverdicted · novelty 4.0

A deep research agent incorporates progressive confidence estimation and calibration to produce trustworthy reports with transparent confidence scores on claims.

MARGIN: Runtime Confidence Calibration for Multi-Agent Foundation Model Coordination

cs.LG · 2026-05-21

citing papers explorer

Showing 4 of 4 citing papers.

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation cs.LG · 2026-04-21 · unverdicted · none · ref 177
Unsupervised single-generation confidence calibration for reasoning LLMs via offline self-consistency proxy distillation outperforms baselines on math and QA tasks and improves selective prediction.
LLMs Should Express Uncertainty Explicitly cs.LG · 2026-04-07 · unverdicted · none · ref 5 · 2 links
Training LLMs to verbalize uncertainty explicitly at the end or during reasoning reduces overconfident errors and improves answer quality on factual tasks while enabling RAG triggers.
Towards Trustworthy Report Generation: A Deep Research Agent with Progressive Confidence Estimation and Calibration cs.AI · 2026-04-07 · unverdicted · none · ref 12
A deep research agent incorporates progressive confidence estimation and calibration to produce trustworthy reports with transparent confidence scores on claims.
MARGIN: Runtime Confidence Calibration for Multi-Agent Foundation Model Coordination cs.LG · 2026-05-21 · unreviewed · ref 22

Your pre-trained LLM is secretly an unsupervised confidence calibrator

fields

years

verdicts

representative citing papers

citing papers explorer