Uncertainty quantification for language models: A suite of black-box, white-box, LLM judge, and ensemble scorers

· 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren't Worth Training

cs.AI · 2026-05-04 · conditional · novelty 6.0

Average token log-probability provides a zero-shot confidence signal for small LLMs that matches supervised baselines in-distribution and outperforms them out-of-distribution, with a new retrieval-conditional variant improving further at lower latency.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren't Worth Training cs.AI · 2026-05-04 · conditional · none · ref 13
Average token log-probability provides a zero-shot confidence signal for small LLMs that matches supervised baselines in-distribution and outperforms them out-of-distribution, with a new retrieval-conditional variant improving further at lower latency.

Uncertainty quantification for language models: A suite of black-box, white-box, LLM judge, and ensemble scorers

fields

years

verdicts

representative citing papers

citing papers explorer