pith. sign in

arxiv: 1812.03973 · v3 · pith:VVPJHF4Znew · submitted 2018-12-10 · 💻 cs.LG · cs.PL· stat.ML

Bayesian Layers: A Module for Neural Network Uncertainty

classification 💻 cs.LG cs.PLstat.ML
keywords bayesianlayersuncertaintyneuralnetworkstochasticcommonmodule
0
0 comments X p. Extension
pith:VVPJHF4Z Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{VVPJHF4Z}

Prints a linked pith:VVPJHF4Z badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We describe Bayesian Layers, a module designed for fast experimentation with neural network uncertainty. It extends neural network libraries with drop-in replacements for common layers. This enables composition via a unified abstraction over deterministic and stochastic functions and allows for scalability via the underlying system. These layers capture uncertainty over weights (Bayesian neural nets), pre-activation units (dropout), activations ("stochastic output layers"), or the function itself (Gaussian processes). They can also be reversible to propagate uncertainty from input to output. We include code examples for common architectures such as Bayesian LSTMs, deep GPs, and flow-based models. As demonstration, we fit a 5-billion parameter "Bayesian Transformer" on 512 TPUv2 cores for uncertainty in machine translation and a Bayesian dynamics model for model-based planning. Finally, we show how Bayesian Layers can be used within the Edward2 probabilistic programming language for probabilistic programs with stochastic processes.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.