Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

A. Taylan Cemgil; \c{C}a\u{g}atay Y{\i}ld{\i}z; Ga\"el Richard; Thanh Huy Nguyen; Umut \c{S}im\c{s}ekli

arxiv: 1806.02617 · v1 · pith:GP3P6AEXnew · submitted 2018-06-07 · 📊 stat.ML · cs.LG

Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

Umut \c{S}im\c{s}ekli , \c{C}a\u{g}atay Y{\i}ld{\i}z , Thanh Huy Nguyen , Ga\"el Richard , A. Taylan Cemgil This is my paper

classification 📊 stat.ML cs.LG

keywords algorithmproposednon-convexoptimizationstochasticcertainconditionsconvergence

0 comments

read the original abstract

Recent studies have illustrated that stochastic gradient Markov Chain Monte Carlo techniques have a strong potential in non-convex optimization, where local and global convergence guarantees can be shown under certain conditions. By building up on this recent theory, in this study, we develop an asynchronous-parallel stochastic L-BFGS algorithm for non-convex optimization. The proposed algorithm is suitable for both distributed and shared-memory settings. We provide formal theoretical analysis and show that the proposed method achieves an ergodic convergence rate of ${\cal O}(1/\sqrt{N})$ ($N$ being the total number of iterations) and it can achieve a linear speedup under certain conditions. We perform several experiments on both synthetic and real datasets. The results support our theory and show that the proposed algorithm provides a significant speedup over the recently proposed synchronous distributed L-BFGS algorithm.

This paper has not been read by Pith yet.

Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

discussion (0)