Efficient multicore-aware parallelization strategies for iterative stencil computations

Georg Hager; Gerhard Wellein; Jan Treibig

arxiv: 1004.1741 · v1 · pith:XI2LQX7Fnew · submitted 2010-04-10 · 💻 cs.PF · cs.DC

Efficient multicore-aware parallelization strategies for iterative stencil computations

Jan Treibig , Gerhard Wellein , Georg Hager This is my paper

classification 💻 cs.PF cs.DC

keywords blockingcomputationsefficientgauss-seideliterativemulticoreoptimizationsmoothers

0 comments

read the original abstract

Stencil computations consume a major part of runtime in many scientific simulation codes. As prototypes for this class of algorithms we consider the iterative Jacobi and Gauss-Seidel smoothers and aim at highly efficient parallel implementations for cache-based multicore architectures. Temporal cache blocking is a known advanced optimization technique, which can reduce the pressure on the memory bus significantly. We apply and refine this optimization for a recently presented temporal blocking strategy designed to explicitly utilize multicore characteristics. Especially for the case of Gauss-Seidel smoothers we show that simultaneous multi-threading (SMT) can yield substantial performance improvements for our optimized algorithm.

This paper has not been read by Pith yet.

Efficient multicore-aware parallelization strategies for iterative stencil computations

discussion (0)