Inference Suboptimality in Variational Autoencoders

Chris Cremer; David Duvenaud; Xuechen Li

arxiv: 1801.03558 · v3 · pith:IAVBEVMCnew · submitted 2018-01-10 · 💻 cs.LG · stat.ML

Inference Suboptimality in Variational Autoencoders

Chris Cremer , Xuechen Li , David Duvenaud This is my paper

classification 💻 cs.LG stat.ML

keywords inferencevariationalapproximationapproximateautoencoderscomplexitydistributionfactors

0 comments

read the original abstract

Amortized inference allows latent-variable models trained via variational learning to scale to large datasets. The quality of approximate inference is determined by two factors: a) the capacity of the variational distribution to match the true posterior and b) the ability of the recognition network to produce good variational parameters for each datapoint. We examine approximate inference in variational autoencoders in terms of these factors. We find that divergence from the true posterior is often due to imperfect recognition networks, rather than the limited complexity of the approximating distribution. We show that this is due partly to the generator learning to accommodate the choice of approximation. Furthermore, we show that the parameters used to increase the expressiveness of the approximation play a role in generalizing inference rather than simply improving the complexity of the approximation.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Tessellations of Semi-Discrete Flow Matching
cs.LG 2026-05 unverdicted novelty 7.0

Semi-discrete Flow Matching produces terminal assignment regions that are topologically simple (open, simply connected, homeomorphic to the ball under assumption) yet geometrically distinct from optimal transport Lagu...
Bayesian Neural Networks: An Introduction and Survey
stat.ML 2020-06 unverdicted novelty 1.0

A survey introducing Bayesian Neural Networks and comparing approximate inference methods to enable uncertainty quantification in neural network predictions.