Tight analyses for non-smooth stochastic gradient descent

Nicholas JA Harvey, Christopher Liaw, Yaniv Plan, Sikander Randhawa · 2019

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Gradient Descent's Last Iterate is Often (slightly) Suboptimal

math.OC · 2026-04-15 · unverdicted · novelty 8.0

Proves it is impossible to achieve optimal last-iterate rates for GD and SGD without knowing the horizon T in advance, incurring an unavoidable poly-log factor penalty even in the deterministic case.

Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size

cs.LG · 2026-04-10 · unverdicted · novelty 6.0

SGD with greedy step size on smooth quadratics in the interpolation regime attains O(1/t^{3/4}) last-iterate convergence.

citing papers explorer

Showing 2 of 2 citing papers.

Gradient Descent's Last Iterate is Often (slightly) Suboptimal math.OC · 2026-04-15 · unverdicted · none · ref 10
Proves it is impossible to achieve optimal last-iterate rates for GD and SGD without knowing the horizon T in advance, incurring an unavoidable poly-log factor penalty even in the deterministic case.
Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size cs.LG · 2026-04-10 · unverdicted · none · ref 17
SGD with greedy step size on smooth quadratics in the interpolation regime attains O(1/t^{3/4}) last-iterate convergence.

Tight analyses for non-smooth stochastic gradient descent

fields

years

verdicts

representative citing papers

citing papers explorer