Programming Parallel Dense Matrix Factorizations with Look-Ahead and OpenMP

Adri\'an Castell\'o; Enrique S. Quintana-Ort\'i; Francisco D. Igual; Rafael Rodr\'iguez-S\'anchez; Sandra Catal\'an

arxiv: 1804.07017 · v1 · pith:GGE5PETSnew · submitted 2018-04-19 · 💻 cs.DC · cs.MS

Programming Parallel Dense Matrix Factorizations with Look-Ahead and OpenMP

Sandra Catal\'an , Adri\'an Castell\'o , Francisco D. Igual , Rafael Rodr\'iguez-S\'anchez , Enrique S. Quintana-Ort\'i This is my paper

classification 💻 cs.DC cs.MS

keywords implementationperformancealgorithmsblasdensefactorizationhighlook-ahead

0 comments

read the original abstract

We investigate a parallelization strategy for dense matrix factorization (DMF) algorithms, using OpenMP, that departs from the legacy (or conventional) solution, which simply extracts concurrency from a multithreaded version of BLAS. This approach is also different from the more sophisticated runtime-assisted implementations, which decompose the operation into tasks and identify dependencies via directives and runtime support. Instead, our strategy attains high performance by explicitly embedding a static look-ahead technique into the DMF code, in order to overcome the performance bottleneck of the panel factorization, and realizing the trailing update via a cache-aware multi-threaded implementation of the BLAS. Although the parallel algorithms are specified with a highlevel of abstraction, the actual implementation can be easily derived from them, paving the road to deriving a high performance implementation of a considerable fraction of LAPACK functionality on any multicore platform with an OpenMP-like runtime.

This paper has not been read by Pith yet.

Programming Parallel Dense Matrix Factorizations with Look-Ahead and OpenMP

discussion (0)