pith. sign in

arxiv: 0806.2159 · v3 · submitted 2008-06-12 · 💻 cs.NA

Communication-optimal parallel and sequential QR and LU factorizations: theory and practice

classification 💻 cs.NA
keywords blockfactorsalgorithmcyclicfactorizationlayoutmatricesparallel
0
0 comments X
read the original abstract

We present parallel and sequential dense QR factorization algorithms that are both optimal (up to polylogarithmic factors) in the amount of communication they perform, and just as stable as Householder QR. Our first algorithm, Tall Skinny QR (TSQR), factors m-by-n matrices in a one-dimensional (1-D) block cyclic row layout, and is optimized for m >> n. Our second algorithm, CAQR (Communication-Avoiding QR), factors general rectangular matrices distributed in a two-dimensional block cyclic layout. It invokes TSQR for each block column factorization.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.