pith. sign in

arxiv: 1809.10330 · v3 · pith:QYT7NUDNnew · submitted 2018-09-27 · 📊 stat.ML · cs.LG· stat.CO

Variance reduction properties of the reparameterization trick

classification 📊 stat.ML cs.LGstat.CO
keywords reparameterizationvariationalfunctiongradienttrickdensityestimatoridealized
0
0 comments X
read the original abstract

The reparameterization trick is widely used in variational inference as it yields more accurate estimates of the gradient of the variational objective than alternative approaches such as the score function method. Although there is overwhelming empirical evidence in the literature showing its success, there is relatively little research exploring why the reparameterization trick is so effective. We explore this under the idealized assumptions that the variational approximation is a mean-field Gaussian density and that the log of the joint density of the model parameters and the data is a quadratic function that depends on the variational mean. From this, we show that the marginal variances of the reparameterization gradient estimator are smaller than those of the score function gradient estimator. We apply the result of our idealized analysis to real-world examples.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.