Unsupervised single-generation confidence calibration for reasoning LLMs via offline self-consistency proxy distillation outperforms baselines on math and QA tasks and improves selective prediction.
Calibrated Selective Classification , publisher =
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.LG 2representative citing papers
Selective Conformal Risk Control combines selective classification with conformal risk control to produce compact prediction sets that meet target coverage and risk levels.
citing papers explorer
-
Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation
Unsupervised single-generation confidence calibration for reasoning LLMs via offline self-consistency proxy distillation outperforms baselines on math and QA tasks and improves selective prediction.
-
Selective Conformal Risk Control
Selective Conformal Risk Control combines selective classification with conformal risk control to produce compact prediction sets that meet target coverage and risk levels.