Convergence rates for distributed stochastic optimization over random networks

arxiv: 1803.07836 · v1 · pith:JCABYPN5new · submitted 2018-03-21 · 🧮 math.OC

Convergence rates for distributed stochastic optimization over random networks

Dusan Jakovetic , Dragana Bajovic , Anit Kumar Sahu , Soummya Kar This is my paper

classification 🧮 math.OC

keywords distributedgradientconvergencestochasticrandomrateaverageconsensus

0 comments p. Extension

pith:JCABYPN5 Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{JCABYPN5}

Prints a linked pith:JCABYPN5 badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We establish the O($\frac{1}{k}$) convergence rate for distributed stochastic gradient methods that operate over strongly convex costs and random networks. The considered class of methods is standard each node performs a weighted average of its own and its neighbors solution estimates (consensus), and takes a negative step with respect to a noisy version of its local functions gradient (innovation). The underlying communication network is modeled through a sequence of temporally independent identically distributed (i.i.d.) Laplacian matrices connected on average, while the local gradient noises are also i.i.d. in time, have finite second moment, and possibly unbounded support. We show that, after a careful setting of the consensus and innovations potentials (weights), the distributed stochastic gradient method achieves a (order-optimal) O($\frac{1}{k}$) convergence rate in the mean square distance from the solution. This is the first order-optimal convergence rate result on distributed strongly convex stochastic optimization when the network is random and/or the gradient noises have unbounded support. Simulation examples confirm the theoretical findings.

This paper has not been read by Pith yet.

Convergence rates for distributed stochastic optimization over random networks

discussion (0)