pith. sign in

arxiv: 1805.05278 · v1 · pith:HDDVL2G7new · submitted 2018-05-14 · 💻 cs.DC

A 3D Parallel Algorithm for QR Decomposition

classification 💻 cs.DC
keywords algorithmcommunicationcostbandwidthlatencyparallelcomputationscomputing
0
0 comments X
read the original abstract

Interprocessor communication often dominates the runtime of large matrix computations. We present a parallel algorithm for computing QR decompositions whose bandwidth cost (communication volume) can be decreased at the cost of increasing its latency cost (number of messages). By varying a parameter to navigate the bandwidth/latency tradeoff, we can tune this algorithm for machines with different communication costs.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.