pith. sign in

arxiv: 1006.2183 · v1 · pith:TMVFVVTVnew · submitted 2010-06-11 · 💻 cs.DC · cs.MS· cs.NA· cs.PF

Highly Parallel Sparse Matrix-Matrix Multiplication

classification 💻 cs.DC cs.MScs.NAcs.PF
keywords algorithmssparsematrix-matrixmultiplicationparallelprocessorsachieveblock
0
0 comments X
read the original abstract

Generalized sparse matrix-matrix multiplication is a key primitive for many high performance graph algorithms as well as some linear solvers such as multigrid. We present the first parallel algorithms that achieve increasing speedups for an unbounded number of processors. Our algorithms are based on two-dimensional block distribution of sparse matrices where serial sections use a novel hypersparse kernel for scalability. We give a state-of-the-art MPI implementation of one of our algorithms. Our experiments show scaling up to thousands of processors on a variety of test scenarios.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A 2D Parallel Triangle Counting Algorithm for Distributed-Memory Architectures

    cs.DC 2019-07 unverdicted novelty 6.0

    Introduces a 2D cyclic decomposition triangle counting algorithm for distributed-memory systems achieving 3.24-7.22x relative speedup on 169 MPI ranks and 10.2x over prior distributed algorithms.