Stochastic Cubic Regularization for Fast Nonconvex Optimization

Chi Jin; Jeffrey Regier; Michael I. Jordan; Mitchell Stern; Nilesh Tripuraneni

arxiv: 1711.02838 · v2 · pith:APCY5ZPEnew · submitted 2017-11-08 · 💻 cs.LG · math.OC· stat.ML

Stochastic Cubic Regularization for Fast Nonconvex Optimization

Nilesh Tripuraneni , Mitchell Stern , Chi Jin , Jeffrey Regier , Michael I. Jordan This is my paper

classification 💻 cs.LG math.OCstat.ML

keywords stochasticefficientlyepsilongradientlocalmathcalminimanonconvex

0 comments

read the original abstract

This paper proposes a stochastic variant of a classic algorithm---the cubic-regularized Newton method [Nesterov and Polyak 2006]. The proposed algorithm efficiently escapes saddle points and finds approximate local minima for general smooth, nonconvex functions in only $\mathcal{\tilde{O}}(\epsilon^{-3.5})$ stochastic gradient and stochastic Hessian-vector product evaluations. The latter can be computed as efficiently as stochastic gradients. This improves upon the $\mathcal{\tilde{O}}(\epsilon^{-4})$ rate of stochastic gradient descent. Our rate matches the best-known result for finding local minima without requiring any delicate acceleration or variance-reduction techniques.

This paper has not been read by Pith yet.

Stochastic Cubic Regularization for Fast Nonconvex Optimization

discussion (0)