pith. sign in

arxiv: 1805.11659 · v2 · pith:MSVM7BNYnew · submitted 2018-05-29 · 📊 stat.ML · cs.LG

A Unified Particle-Optimization Framework for Scalable Bayesian Sampling

classification 📊 stat.ML cs.LG
keywords sg-mcmcframeworksvgdbayesiangradientsamplessamplingscalable
0
0 comments X
read the original abstract

There has been recent interest in developing scalable Bayesian sampling methods such as stochastic gradient MCMC (SG-MCMC) and Stein variational gradient descent (SVGD) for big-data analysis. A standard SG-MCMC algorithm simulates samples from a discrete-time Markov chain to approximate a target distribution, thus samples could be highly correlated, an undesired property for SG-MCMC. In contrary, SVGD directly optimizes a set of particles to approximate a target distribution, and thus is able to obtain good approximations with relatively much fewer samples. In this paper, we propose a principle particle-optimization framework based on Wasserstein gradient flows to unify SG-MCMC and SVGD, and to allow new algorithms to be developed. Our framework interprets SG-MCMC as particle optimization on the space of probability measures, revealing a strong connection between SG-MCMC and SVGD. The key component of our framework is several particle-approximate techniques to efficiently solve the original partial differential equations on the space of probability measures. Extensive experiments on both synthetic data and deep neural networks demonstrate the effectiveness and efficiency of our framework for scalable Bayesian sampling.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Particle-based Energetic Variational Inference

    stat.ML 2020-04 unverdicted novelty 7.0

    Energetic variational inference derives existing particle-based VI methods from energy-dissipation laws and proposes an approximation-then-variation scheme that preserves particle-level structure while reducing KL divergence.