Step-Size Stability in Stochastic Optimization: A Theoretical Perspective

Adrien Taylor; Fabian Schaipp; Robert M. Gower

arxiv: 2602.09842 · v2 · pith:ZQNPQL23new · submitted 2026-02-10 · 🧮 math.OC · cs.LG

Step-Size Stability in Stochastic Optimization: A Theoretical Perspective

Fabian Schaipp , Robert M. Gower , Adrien Taylor This is my paper

classification 🧮 math.OC cs.LG

keywords theoreticalmethodssizestepadaptiveanalysisboundmethod

0 comments

read the original abstract

We present a theoretical analysis of stochastic optimization methods in terms of their sensitivity with respect to the step size. We identify a key quantity that, for each method, describes how the performance degrades as the step size becomes too large. For convex problems, we show that this quantity directly impacts the suboptimality bound of the method. Most importantly, our analysis provides direct theoretical evidence that adaptive step-size methods, such as SPS or NGN, are more robust than SGD. This allows us to quantify the advantage of these adaptive methods beyond empirical evaluation. Finally, we show through experiments that our theoretical bound qualitatively mirrors the actual performance as a function of the step size, even for non-convex problems.

This paper has not been read by Pith yet.

Step-Size Stability in Stochastic Optimization: A Theoretical Perspective

discussion (0)