pith. machine review for the scientific record. sign in

arxiv: 1904.02130 · v1 · pith:D5ZPEWGMnew · submitted 2019-04-03 · 🧮 math.ST · math.OC· math.PR· stat.ML· stat.TH

Normal Approximation for Stochastic Gradient Descent via Non-Asymptotic Rates of Martingale CLT

classification 🧮 math.ST math.OCmath.PRstat.MLstat.TH
keywords non-asymptoticmartingaleratesnormalaveragedconvergencedescentgradient
0
0 comments X
read the original abstract

We provide non-asymptotic convergence rates of the Polyak-Ruppert averaged stochastic gradient descent (SGD) to a normal random vector for a class of twice-differentiable test functions. A crucial intermediate step is proving a non-asymptotic martingale central limit theorem (CLT), i.e., establishing the rates of convergence of a multivariate martingale difference sequence to a normal random vector, which might be of independent interest. We obtain the explicit rates for the multivariate martingale CLT using a combination of Stein's method and Lindeberg's argument, which is then used in conjunction with a non-asymptotic analysis of averaged SGD proposed in [PJ92]. Our results have potentially interesting consequences for computing confidence intervals for parameter estimation with SGD and constructing hypothesis tests with SGD that are valid in a non-asymptotic sense.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.