Accurate Uncertainties for Deep Learning Using Calibrated Regression

Nathan Fenner; Stefano Ermon; Volodymyr Kuleshov

arxiv: 1807.00263 · v1 · pith:ZK4C6XOPnew · submitted 2018-07-01 · 💻 cs.LG · stat.ML

Accurate Uncertainties for Deep Learning Using Calibrated Regression

Volodymyr Kuleshov , Nathan Fenner , Stefano Ermon This is my paper

classification 💻 cs.LG stat.ML

keywords bayesianuncertaintylearningregressionaccuratecalibratedcredibleestimates

0 comments

read the original abstract

Methods for reasoning under uncertainty are a key building block of accurate and reliable machine learning systems. Bayesian methods provide a general framework to quantify uncertainty. However, because of model misspecification and the use of approximate inference, Bayesian uncertainty estimates are often inaccurate -- for example, a 90% credible interval may not contain the true outcome 90% of the time. Here, we propose a simple procedure for calibrating any regression algorithm; when applied to Bayesian and probabilistic models, it is guaranteed to produce calibrated uncertainty estimates given enough data. Our procedure is inspired by Platt scaling and extends previous work on classification. We evaluate this approach on Bayesian linear regression, feedforward, and recurrent neural networks, and find that it consistently outputs well-calibrated credible intervals while improving performance on time series forecasting and model-based reinforcement learning tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Unsolved Problems in ML Safety
cs.LG 2021-09 accept novelty 6.0

The paper presents a roadmap that identifies four unsolved problems in ML safety: robustness against hazards, monitoring for hazards, alignment of model goals with human intent, and systemic safety.
Continuous ageing trajectory representations for knee-aware lifetime prediction of lithium-ion batteries across heterogeneous dataset
cs.LG 2026-04 unverdicted novelty 5.0

Continuous trajectory representations of lithium-ion battery aging enable consistent knee-point detection and early remaining useful life predictions that remain robust across heterogeneous datasets.
Bayesian Neural Networks: An Introduction and Survey
stat.ML 2020-06 unverdicted novelty 1.0

A survey introducing Bayesian Neural Networks and comparing approximate inference methods to enable uncertainty quantification in neural network predictions.