pith. machine review for the scientific record. sign in

arxiv: 1711.05224 · v3 · submitted 2017-11-14 · 🧮 math.OC

Recognition: unknown

Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points

Authors on Pith no claims yet
classification 🧮 math.OC
keywords saddledescentpointsgradientproblemsarbitrarilyescapekappa
0
0 comments X
read the original abstract

The note considers normalized gradient descent (NGD), a natural modification of classical gradient descent (GD) in optimization problems. A serious shortcoming of GD in non-convex problems is that GD may take arbitrarily long to escape from the neighborhood of a saddle point. This issue can make the convergence of GD arbitrarily slow, particularly in high-dimensional non-convex problems where the relative number of saddle points is often large. The paper focuses on continuous-time descent. It is shown that, contrary to standard GD, NGD escapes saddle points `quickly.' In particular, it is shown that (i) NGD `almost never' converges to saddle points and (ii) the time required for NGD to escape from a ball of radius $r$ about a saddle point $x^*$ is at most $5\sqrt{\kappa}r$, where $\kappa$ is the condition number of the Hessian of $f$ at $x^*$. As an application of this result, a global convergence-time bound is established for NGD under mild assumptions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Principled Design of Diffusion-based Optimizers for Inverse Problems

    cs.CV 2026-05 unverdicted novelty 5.0

    Reparameterizations create invariances in diffusion inverse-problem solvers, enabling hyperparameter reuse and accelerated inference via the OptDiff optimization framework.