First-order Methods Almost Always Avoid Saddle Points

Benjamin Recht; Georgios Piliouras; Ioannis Panageas; Jason D. Lee; Max Simchowitz; Michael I. Jordan

arxiv: 1710.07406 · v1 · pith:BTZLZMD7new · submitted 2017-10-20 · 📊 stat.ML · cs.LG· math.OC

First-order Methods Almost Always Avoid Saddle Points

Jason D. Lee , Ioannis Panageas , Georgios Piliouras , Max Simchowitz , Michael I. Jordan , Benjamin Recht This is my paper

classification 📊 stat.ML cs.LGmath.OC

keywords avoiddescentfirst-ordermethodspointssaddlealmostaccess

0 comments

read the original abstract

We establish that first-order methods avoid saddle points for almost all initializations. Our results apply to a wide variety of first-order methods, including gradient descent, block coordinate descent, mirror descent and variants thereof. The connecting thread is that such algorithms can be studied from a dynamical systems perspective in which appropriate instantiations of the Stable Manifold Theorem allow for a global stability analysis. Thus, neither access to second-order derivative information nor randomness beyond initialization is necessary to provably avoid saddle points.

This paper has not been read by Pith yet.

First-order Methods Almost Always Avoid Saddle Points

discussion (0)