Accelerated Methods for Non-Convex Optimization

Aaron Sidford; John C. Duchi; Oliver Hinder; Yair Carmon

arxiv: 1611.00756 · v2 · pith:4G3DP3LUnew · submitted 2016-11-02 · 🧮 math.OC · cs.DS

Accelerated Methods for Non-Convex Optimization

Yair Carmon , John C. Duchi , Oliver Hinder , Aaron Sidford This is my paper

classification 🧮 math.OC cs.DS

keywords epsilonmethodgradientacceleratednablanon-convexoptimizationpoint

0 comments

read the original abstract

We present an accelerated gradient method for non-convex optimization problems with Lipschitz continuous first and second derivatives. The method requires time $O(\epsilon^{-7/4} \log(1/ \epsilon) )$ to find an $\epsilon$-stationary point, meaning a point $x$ such that $\|\nabla f(x)\| \le \epsilon$. The method improves upon the $O(\epsilon^{-2} )$ complexity of gradient descent and provides the additional second-order guarantee that $\nabla^2 f(x) \succeq -O(\epsilon^{1/2})I$ for the computed $x$. Furthermore, our method is Hessian free, i.e. it only requires gradient computations, and is therefore suitable for large scale applications.

This paper has not been read by Pith yet.

Accelerated Methods for Non-Convex Optimization

discussion (0)