A Unified Coding Framework for Distributed Computing with Straggling Servers
read the original abstract
We propose a unified coded framework for distributed computing with straggling servers, by introducing a tradeoff between "latency of computation" and "load of communication" for some linear computation tasks. We show that the coded scheme of [1]-[3] that repeats the intermediate computations to create coded multicasting opportunities to reduce communication load, and the coded scheme of [4], [5] that generates redundant intermediate computations to combat against straggling servers can be viewed as special instances of the proposed framework, by considering two extremes of this tradeoff: minimizing either the load of communication or the latency of computation individually. Furthermore, the latency-load tradeoff achieved by the proposed coded framework allows to systematically operate at any point on that tradeoff to perform distributed computing tasks. We also prove an information-theoretic lower bound on the latency-load tradeoff, which is shown to be within a constant multiplicative gap from the achieved tradeoff at the two end points.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Coded Distributed Computing: Performance Limits and Code Designs
Coded distributed computing execution time equals erasure-channel error probability for linear codes, with explicit expressions for binary random linear codes and asymptotic optimality for binary codes matching any li...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.