pith. sign in

arxiv: 1710.09430 · v2 · pith:KCNVS6PDnew · submitted 2017-10-25 · 📊 stat.ML · cs.LG· math.OC

A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares)

classification 📊 stat.ML cs.LGmath.OC
keywords optimalitystochasticcharacterizingdescentgradientleastminimaxprocess
0
0 comments X
read the original abstract

This work provides a simplified proof of the statistical minimax optimality of (iterate averaged) stochastic gradient descent (SGD), for the special case of least squares. This result is obtained by analyzing SGD as a stochastic process and by sharply characterizing the stationary covariance matrix of this process. The finite rate optimality characterization captures the constant factors and addresses model mis-specification.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.