Communication-Efficient Distributed Strongly Convex Stochastic Optimization: Non-Asymptotic Rates

Anit Kumar Sahu; Dragana Bajovic; Dusan Jakovetic; Soummya Kar

arxiv: 1809.02920 · v1 · pith:5BVHYCFInew · submitted 2018-09-09 · 🧮 math.OC

Communication-Efficient Distributed Strongly Convex Stochastic Optimization: Non-Asymptotic Rates

Anit Kumar Sahu , Dusan Jakovetic , Dragana Bajovic , Soummya Kar This is my paper

classification 🧮 math.OC

keywords optimizationordermathrmfirstzerothcostratescomm

0 comments

read the original abstract

We examine fundamental tradeoffs in iterative distributed zeroth and first order stochastic optimization in multi-agent networks in terms of \emph{communication cost} (number of per-node transmissions) and \emph{computational cost}, measured by the number of per-node noisy function (respectively, gradient) evaluations with zeroth order (respectively, first order) methods. Specifically, we develop novel distributed stochastic optimization methods for zeroth and first order strongly convex optimization by utilizing a probabilistic inter-agent communication protocol that increasingly sparsifies communications among agents as time progresses. Under standard assumptions on the cost functions and the noise statistics, we establish with the proposed method the $O(1/(C_{\mathrm{comm}})^{4/3-\zeta})$ and $O(1/(C_{\mathrm{comm}})^{8/9-\zeta})$ mean square error convergence rates, for the first and zeroth order optimization, respectively, where $C_{\mathrm{comm}}$ is the expected number of network communications and $\zeta>0$ is arbitrarily small. The methods are shown to achieve order-optimal convergence rates in terms of computational cost~$C_{\mathrm{comp}}$, $O(1/C_{\mathrm{comp}})$ (first order optimization) and $O(1/(C_{\mathrm{comp}})^{2/3})$ (zeroth order optimization), while achieving the order-optimal convergence rates in terms of iterations. Experiments on real-life datasets illustrate the efficacy of the proposed algorithms.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Variance-Reduced Gradient Estimator for Nonconvex Zeroth-Order Distributed Optimization
math.OC 2024-09 unverdicted novelty 7.0

Introduces a novel variance-reduced gradient estimator using Bernoulli distribution for single-direction renovation or full correction, integrated with gradient tracking, achieving oracle complexities O(d/ε) for smoot...