Universally Decodable Matrices for Distributed Matrix-Vector Multiplication
read the original abstract
Coded computation is an emerging research area that leverages concepts from erasure coding to mitigate the effect of stragglers (slow nodes) in distributed computation clusters, especially for matrix computation problems. In this work, we present a class of distributed matrix-vector multiplication schemes that are based on codes in the Rosenbloom-Tsfasman metric and universally decodable matrices. Our schemes take into account the inherent computation order within a worker node. In particular, they allow us to effectively leverage partial computations performed by stragglers (a feature that many prior works lack). An additional main contribution of our work is a companion matrix-based embedding of these codes that allows us to obtain sparse and numerically stable schemes for the problem at hand. Experimental results confirm the effectiveness of our techniques.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Random Khatri-Rao-Product Codes for Numerically-Stable Distributed Matrix Multiplication
RKRP codes are MDS with probability 1, have identical communication/encoding costs to prior codes, lower average decoding complexity than OrthoPoly, and show substantially lower reconstruction error in numerical tests.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.