Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference

Yarin Gal; Zoubin Ghahramani

arxiv: 1506.02158 · v6 · pith:YJQDLFIOnew · submitted 2015-06-06 · 📊 stat.ML · cs.LG

Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference

Yarin Gal , Zoubin Ghahramani This is my paper

classification 📊 stat.ML cs.LG

keywords dataapproximatebayesiancnnsmodelnetworksneuralbernoulli

0 comments

read the original abstract

Convolutional neural networks (CNNs) work well on large datasets. But labelled data is hard to collect, and in some applications larger amounts of data are not available. The problem then is how to use CNNs with small data -- as CNNs overfit quickly. We present an efficient Bayesian CNN, offering better robustness to over-fitting on small data than traditional approaches. This is by placing a probability distribution over the CNN's kernels. We approximate our model's intractable posterior with Bernoulli variational distributions, requiring no additional model parameters. On the theoretical side, we cast dropout network training as approximate inference in Bayesian neural networks. This allows us to implement our model using existing tools in deep learning with no increase in time complexity, while highlighting a negative result in the field. We show a considerable improvement in classification accuracy compared to standard techniques and improve on published state-of-the-art results for CIFAR-10.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

U-FaceBP: Uncertainty-aware Bayesian Ensemble Deep Learning for Face Video-based Blood Pressure Estimation
cs.CV 2024-12 unverdicted novelty 6.0

U-FaceBP combines multiple Bayesian neural networks in an ensemble to estimate blood pressure from face video modalities while quantifying uncertainty, showing improved performance on datasets with 1197 diverse subjects.
Unsupervised Domain Adaptation via Calibrating Uncertainties
cs.LG 2019-07 unverdicted novelty 6.0

A new regularization approach for unsupervised domain adaptation that calibrates Renyi entropy of uncertainties estimated via variational Bayes.
Are Candidate Models Really Needed for Active Learning?
cs.CV 2026-05 unverdicted novelty 5.0

Active learning with randomly initialized models achieves comparable results to traditional candidate-model methods, with low-confidence sampling proving most effective.
Introduction to Camera Pose Estimation with Deep Learning
cs.CV 2019-07 unverdicted novelty 2.0

A survey of deep learning approaches for regressing absolute camera pose from single RGB images, covering key methods, trends, cross-comparisons, and reproducibility notes.