Distributed stochastic optimization with gradient tracking over strongly-connected networks

Anit Kumar Sahu; Ran Xin; Soummya Kar; Usman A. Khan

arxiv: 1903.07266 · v2 · pith:6EXVHJ3Snew · submitted 2019-03-18 · 💻 cs.LG · cs.DC· cs.MA· cs.SY· stat.ML

Distributed stochastic optimization with gradient tracking over strongly-connected networks

Ran Xin , Anit Kumar Sahu , Usman A. Khan , Soummya Kar This is my paper

classification 💻 cs.LG cs.DCcs.MAcs.SYstat.ML

keywords mathcaldistributedstochasticstrongly-connectedagentcostglobalgradient

0 comments

read the original abstract

In this paper, we study distributed stochastic optimization to minimize a sum of smooth and strongly-convex local cost functions over a network of agents, communicating over a strongly-connected graph. Assuming that each agent has access to a stochastic first-order oracle ($\mathcal{SFO}$), we propose a novel distributed method, called $\mathcal{S}$-$\mathcal{AB}$, where each agent uses an auxiliary variable to asymptotically track the gradient of the global cost in expectation. The $\mathcal{S}$-$\mathcal{AB}$ algorithm employs row- and column-stochastic weights simultaneously to ensure both consensus and optimality. Since doubly-stochastic weights are not used, $\mathcal{S}$-$\mathcal{AB}$ is applicable to arbitrary strongly-connected graphs. We show that under a sufficiently small constant step-size, $\mathcal{S}$-$\mathcal{AB}$ converges linearly (in expected mean-square sense) to a neighborhood of the global minimizer. We present numerical simulations based on real-world data sets to illustrate the theoretical results.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Distributed Learning in Non-Convex Environments -- Part II: Polynomial Escape from Saddle-Points
cs.MA 2019-07 unverdicted novelty 6.0

Diffusion strategy for distributed learning escapes saddle points in O(1/μ) iterations and returns approximate second-order stationary points in polynomial iterations with less restrictive noise assumptions than centr...
Distributed Learning in Non-Convex Environments -- Part I: Agreement at a Linear Rate
math.OC 2019-07 unverdicted novelty 5.0

Diffusion learning achieves linear-rate agreement around the network centroid in stochastic non-convex distributed optimization.