Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory

Georg Hager; Gerhard Wellein; Markus Wittmann

arxiv: 0912.4506 · v1 · pith:RLTXGYZGnew · submitted 2009-12-22 · 💻 cs.PF · cs.DC

Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory

Markus Wittmann , Georg Hager , Gerhard Wellein This is my paper

classification 💻 cs.PF cs.DC

keywords blockingsharedtemporalcodesmemorymulticorestencilaccelerating

0 comments

read the original abstract

New algorithms and optimization techniques are needed to balance the accelerating trend towards bandwidth-starved multicore chips. It is well known that the performance of stencil codes can be improved by temporal blocking, lessening the pressure on the memory interface. We introduce a new pipelined approach that makes explicit use of shared caches in multicore environments and minimizes synchronization and boundary overhead. For clusters of shared-memory nodes we demonstrate how temporal blocking can be employed successfully in a hybrid shared/distributed-memory environment.

This paper has not been read by Pith yet.

Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory

discussion (0)