pith. sign in

arxiv: 1807.05328 · v1 · pith:FBPIZ3HDnew · submitted 2018-07-14 · 💻 cs.LG · math.OC· stat.ML

On the Acceleration of L-BFGS with Second-Order Information and Stochastic Batches

classification 💻 cs.LG math.OCstat.ML
keywords batchesl-bfgsstochasticaccelerationinformationsecond-orderachievingaddition
0
0 comments X
read the original abstract

This paper proposes a framework of L-BFGS based on the (approximate) second-order information with stochastic batches, as a novel approach to the finite-sum minimization problems. Different from the classical L-BFGS where stochastic batches lead to instability, we use a smooth estimate for the evaluations of the gradient differences while achieving acceleration by well-scaling the initial Hessians. We provide theoretical analyses for both convex and nonconvex cases. In addition, we demonstrate that within the popular applications of least-square and cross-entropy losses, the algorithm admits a simple implementation in the distributed environment. Numerical experiments support the efficiency of our algorithms.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.