Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

Casey Chu; Jose Blanchet; Peter Glynn

arxiv: 1901.10691 · v2 · pith:N7V5SPR2new · submitted 2019-01-30 · 💻 cs.LG · stat.ML

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

Casey Chu , Jose Blanchet , Peter Glynn This is my paper

classification 💻 cs.LG stat.ML

keywords learningprobabilityalgorithmdescentfunctionalinferencemethodsreinforcement

0 comments

read the original abstract

This paper provides a unifying view of a wide range of problems of interest in machine learning by framing them as the minimization of functionals defined on the space of probability measures. In particular, we show that generative adversarial networks, variational inference, and actor-critic methods in reinforcement learning can all be seen through the lens of our framework. We then discuss a generic optimization algorithm for our formulation, called probability functional descent (PFD), and show how this algorithm recovers existing methods developed independently in the settings mentioned earlier.

This paper has not been read by Pith yet.

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

discussion (0)