Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent

· 2025 · stat.ML · arXiv 2502.06719

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open full Pith review browse 5 citing papers arXiv PDF

abstract

In this paper, we establish the non-asymptotic validity of the multiplier bootstrap procedure for constructing the confidence sets using the Stochastic Gradient Descent (SGD) algorithm. Under appropriate regularity conditions, our approach avoids the need to approximate the limiting covariance of Polyak-Ruppert SGD iterates, which allows us to derive approximation rates in convex distance of order up to $1/\sqrt{n}$. Notably, this rate can be faster than the one that can be proven in the Polyak-Juditsky central limit theorem. To our knowledge, this provides the first fully non-asymptotic bound on the accuracy of bootstrap approximations in SGD algorithms. Our analysis builds on the Gaussian approximation results for nonlinear statistics of independent random variables.

representative citing papers

Gaussian Approximation and Multiplier Bootstrap for Federated Linear Stochastic Approximation

stat.ML · 2026-05-19 · unverdicted · novelty 7.0

Establishes non-asymptotic Gaussian approximation bounds for federated LSA with explicit communication-heterogeneity trade-offs and introduces an online multiplier bootstrap for last-iterate inference with validity guarantees.

When Does Dynamic Preconditioning Preserve the Polyak-Ruppert CLT? A Stabilization Threshold

math.ST · 2026-04-26 · unverdicted · novelty 7.0

Dynamic preconditioning preserves the Polyak-Ruppert CLT for averaged SGD if the preconditioner stabilizes at rate β > (α + 1)/2.

Gaussian Approximation for Asynchronous Q-learning

stat.ML · 2026-04-08 · unverdicted · novelty 7.0

Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.

Refining Covariance Matrix Estimation in Stochastic Gradient Descent Through Bias Reduction

stat.ML · 2026-04-23 · unverdicted · novelty 6.0

A novel bias-reduced online covariance estimator for SGD achieves convergence rate n to the power (α-1)/2 times square root of log n without second-order derivatives.

On Gaussian approximation for entropy-regularized Q-learning with function approximation

stat.ML · 2026-05-17 · unverdicted · novelty 5.0

Establishes n^{-1/4} Gaussian approximation in convex distance for averaged entropy-regularized Q-learning with linear function approximation and polynomial stepsizes.

citing papers explorer

Showing 5 of 5 citing papers.

Gaussian Approximation and Multiplier Bootstrap for Federated Linear Stochastic Approximation stat.ML · 2026-05-19 · unverdicted · none · ref 26 · internal anchor
Establishes non-asymptotic Gaussian approximation bounds for federated LSA with explicit communication-heterogeneity trade-offs and introduces an online multiplier bootstrap for last-iterate inference with validity guarantees.
When Does Dynamic Preconditioning Preserve the Polyak-Ruppert CLT? A Stabilization Threshold math.ST · 2026-04-26 · unverdicted · none · ref 37 · internal anchor
Dynamic preconditioning preserves the Polyak-Ruppert CLT for averaged SGD if the preconditioner stabilizes at rate β > (α + 1)/2.
Gaussian Approximation for Asynchronous Q-learning stat.ML · 2026-04-08 · unverdicted · none · ref 45 · internal anchor
Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.
Refining Covariance Matrix Estimation in Stochastic Gradient Descent Through Bias Reduction stat.ML · 2026-04-23 · unverdicted · none · ref 6 · internal anchor
A novel bias-reduced online covariance estimator for SGD achieves convergence rate n to the power (α-1)/2 times square root of log n without second-order derivatives.
On Gaussian approximation for entropy-regularized Q-learning with function approximation stat.ML · 2026-05-17 · unverdicted · none · ref 30 · internal anchor
Establishes n^{-1/4} Gaussian approximation in convex distance for averaged entropy-regularized Q-learning with linear function approximation and polynomial stepsizes.

Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent

fields

years

verdicts

representative citing papers

citing papers explorer