International Conference on Machine Learning , pages=

On calibration of modern neural networks , author= · 2017

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Risk-Controlled Post-Processing of Decision Policies

stat.ML · 2026-05-07 · unverdicted · novelty 7.0

Risk-controlled post-processing yields a threshold-structured policy that follows the baseline except where an oracle fallback sharply reduces conditional violation risk, achieving O(log n/n) expected excess risk in i.i.d. settings and exact risk control under exchangeability.

LLMs as Implicit Imputers: Uncertainty Should Scale with Missing Information

stat.ML · 2026-05-13 · unverdicted · novelty 6.0

Response entropy in LLMs rises with missing context on SQuAD while sampling-based confidence stays high, supporting the multiple imputation criterion and introducing a diagnostic for uncertainty reduction by context level.

Selecting Informative Conformal Prediction Sets with an Optimized FCR-Controlled Approach

stat.ME · 2026-05-21 · unverdicted · novelty 5.0

An oracle-optimal decision policy for informative conformal prediction sets is calibrated to ensure finite-sample FCR control and delivers higher power than prior methods on classification tasks.

Calibrating Model-Based Evaluation Metrics for Summarization

cs.CL · 2026-04-19 · unverdicted · novelty 5.0

A reference-free proxy scoring framework combined with GIRB calibration produces better-aligned evaluation metrics for summarization and outperforms baselines across seven datasets.

citing papers explorer

Showing 4 of 4 citing papers.

Risk-Controlled Post-Processing of Decision Policies stat.ML · 2026-05-07 · unverdicted · none · ref 200
Risk-controlled post-processing yields a threshold-structured policy that follows the baseline except where an oracle fallback sharply reduces conditional violation risk, achieving O(log n/n) expected excess risk in i.i.d. settings and exact risk control under exchangeability.
LLMs as Implicit Imputers: Uncertainty Should Scale with Missing Information stat.ML · 2026-05-13 · unverdicted · none · ref 2
Response entropy in LLMs rises with missing context on SQuAD while sampling-based confidence stays high, supporting the multiple imputation criterion and introducing a diagnostic for uncertainty reduction by context level.
Selecting Informative Conformal Prediction Sets with an Optimized FCR-Controlled Approach stat.ME · 2026-05-21 · unverdicted · none · ref 8
An oracle-optimal decision policy for informative conformal prediction sets is calibrated to ensure finite-sample FCR control and delivers higher power than prior methods on classification tasks.
Calibrating Model-Based Evaluation Metrics for Summarization cs.CL · 2026-04-19 · unverdicted · none · ref 127
A reference-free proxy scoring framework combined with GIRB calibration produces better-aligned evaluation metrics for summarization and outperforms baselines across seven datasets.

International Conference on Machine Learning , pages=

fields

years

verdicts

representative citing papers

citing papers explorer