1909.02712 , archivePrefix=

Jiaqi Zhang, Keyou You · 1909 · arXiv 1909.02712

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Unveiling High-Probability Generalization in Decentralized SGD

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

High-probability generalization bounds for D-SGD are derived at the optimal rate O(1/sqrt(mn) log(1/δ)) via pointwise uniform stability across convex and non-convex settings.

Clipped Stochastic Gradient Tracking For Locally Smooth Functions

math.OC · 2026-05-16 · unverdicted · novelty 6.0

The authors derive a clipped gradient tracking method with staggered variance reduction for RUC-regular finite-sum distributed optimization problems, establishing an O(∑ n_i^{1.5} + n_i^{0.5} ε^{-1}) complexity bound that relies only on local smoothness.

Stability and Generalization for Decentralized Markov SGD

cs.LG · 2026-05-03 · unverdicted · novelty 6.0

Decentralized SGD and SGDA under Markovian sampling admit non-asymptotic generalization bounds that incorporate network topology, Markov mixing rates, and primal-dual dynamics.

citing papers explorer

Showing 3 of 3 citing papers.

Unveiling High-Probability Generalization in Decentralized SGD cs.LG · 2026-05-11 · unverdicted · none · ref 45
High-probability generalization bounds for D-SGD are derived at the optimal rate O(1/sqrt(mn) log(1/δ)) via pointwise uniform stability across convex and non-convex settings.
Clipped Stochastic Gradient Tracking For Locally Smooth Functions math.OC · 2026-05-16 · unverdicted · none · ref 51
The authors derive a clipped gradient tracking method with staggered variance reduction for RUC-regular finite-sum distributed optimization problems, establishing an O(∑ n_i^{1.5} + n_i^{0.5} ε^{-1}) complexity bound that relies only on local smoothness.
Stability and Generalization for Decentralized Markov SGD cs.LG · 2026-05-03 · unverdicted · none · ref 65
Decentralized SGD and SGDA under Markovian sampling admit non-asymptotic generalization bounds that incorporate network topology, Markov mixing rates, and primal-dual dynamics.

1909.02712 , archivePrefix=

fields

years

verdicts

representative citing papers

citing papers explorer