pith. sign in

arxiv: 1509.00309 · v2 · pith:DCYV2XQBnew · submitted 2015-09-01 · 💻 cs.DC

Scalable Task-Based Algorithm for Multiplication of Block-Rank-Sparse Matrices

classification 💻 cs.DC
keywords matricesmultiplicationalgorithmformulationmatrixsummatask-basedblock-rank-sparse
0
0 comments X
read the original abstract

A task-based formulation of Scalable Universal Matrix Multiplication Algorithm (SUMMA), a popular algorithm for matrix multiplication (MM), is applied to the multiplication of hierarchy-free, rank-structured matrices that appear in the domain of quantum chemistry (QC). The novel features of our formulation are: (1) concurrent scheduling of multiple SUMMA iterations, and (2) fine-grained task-based composition. These features make it tolerant of the load imbalance due to the irregular matrix structure and eliminate all artifactual sources of global synchronization.Scalability of iterative computation of square-root inverse of block-rank-sparse QC matrices is demonstrated; for full-rank (dense) matrices the performance of our SUMMA formulation usually exceeds that of the state-of-the-art dense MM implementations (ScaLAPACK and Cyclops Tensor Framework).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.