pith. machine review for the scientific record. sign in

arxiv: 1402.0480 · v5 · submitted 2014-02-03 · 💻 cs.LG · stat.ML

Recognition: unknown

Efficient Gradient-Based Inference through Transformations between Bayes Nets and Neural Nets

Authors on Pith no claims yet
classification 💻 cs.LG stat.ML
keywords inferencegradient-basedmodelsnetsnetworksneuralnon-centeredoften
0
0 comments X
read the original abstract

Hierarchical Bayesian networks and neural networks with stochastic hidden units are commonly perceived as two separate types of models. We show that either of these types of models can often be transformed into an instance of the other, by switching between centered and differentiable non-centered parameterizations of the latent variables. The choice of parameterization greatly influences the efficiency of gradient-based posterior inference; we show that they are often complementary to eachother, we clarify when each parameterization is preferred and show how inference can be made robust. In the non-centered form, a simple Monte Carlo estimator of the marginal likelihood can be used for learning the parameters. Theoretical results are supported by experiments.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.