pith. sign in

arxiv: 1711.05560 · v1 · pith:KYD5COEDnew · submitted 2017-11-15 · 📊 stat.ML · cs.LG

Variational Adaptive-Newton Method for Explorative Learning

classification 📊 stat.ML cs.LG
keywords learningmethodmethodsoptimizationvariationalactivecontinuousexplorative-learning
0
0 comments X
read the original abstract

We present the Variational Adaptive Newton (VAN) method which is a black-box optimization method especially suitable for explorative-learning tasks such as active learning and reinforcement learning. Similar to Bayesian methods, VAN estimates a distribution that can be used for exploration, but requires computations that are similar to continuous optimization methods. Our theoretical contribution reveals that VAN is a second-order method that unifies existing methods in distinct fields of continuous optimization, variational inference, and evolution strategies. Our experimental results show that VAN performs well on a wide-variety of learning tasks. This work presents a general-purpose explorative-learning method that has the potential to improve learning in areas such as active learning and reinforcement learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.