pith. sign in

arxiv: 1905.07436 · v1 · pith:3ZZBULTAnew · submitted 2019-05-17 · 🧮 math.OC · cs.LG· cs.SY· stat.ML

A Dynamical Systems Perspective on Nesterov Acceleration

classification 🧮 math.OC cs.LGcs.SYstat.ML
keywords accelerationnesterovdifferentialdynamicalequationphenomenonacceleratedanalysis
0
0 comments X
read the original abstract

We present a dynamical system framework for understanding Nesterov's accelerated gradient method. In contrast to earlier work, our derivation does not rely on a vanishing step size argument. We show that Nesterov acceleration arises from discretizing an ordinary differential equation with a semi-implicit Euler integration scheme. We analyze both the underlying differential equation as well as the discretization to obtain insights into the phenomenon of acceleration. The analysis suggests that a curvature-dependent damping term lies at the heart of the phenomenon. We further establish connections between the discretized and the continuous-time dynamics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.