pith. machine review for the scientific record. sign in

arxiv: 1811.02091 · v2 · pith:2DCZZUQZnew · submitted 2018-11-05 · 📊 stat.ML · cs.LG· cs.PL

Simple, Distributed, and Accelerated Probabilistic Programming

classification 📊 stat.ML cs.LGcs.PL
keywords probabilisticprogrammingapproachimagenutssimplespeeduptpuv2s
0
0 comments X
read the original abstract

We describe a simple, low-level approach for embedding probabilistic programming in a deep learning ecosystem. In particular, we distill probabilistic programming down to a single abstraction---the random variable. Our lightweight implementation in TensorFlow enables numerous applications: a model-parallel variational auto-encoder (VAE) with 2nd-generation tensor processing units (TPUv2s); a data-parallel autoregressive model (Image Transformer) with TPUv2s; and multi-GPU No-U-Turn Sampler (NUTS). For both a state-of-the-art VAE on 64x64 ImageNet and Image Transformer on 256x256 CelebA-HQ, our approach achieves an optimal linear speedup from 1 to 256 TPUv2 chips. With NUTS, we see a 100x speedup on GPUs over Stan and 37x over PyMC3.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.