pith. sign in

arxiv: 1405.7487 · v1 · pith:5JREKQOSnew · submitted 2014-05-29 · 💻 cs.DC

Asynchronous Execution of the Fast Multipole Method Using Charm++

classification 💻 cs.DC
keywords asynchronouscharmfastapproachbulk-synchronouscodedataexecution
0
0 comments X
read the original abstract

Fast multipole methods (FMM) on distributed mem- ory have traditionally used a bulk-synchronous model of com- municating the local essential tree (LET) and overlapping it with computation of the local data. This could be perceived as an extreme case of data aggregation, where the whole LET is communicated at once. Charm++ allows a much finer control over the granularity of communication, and has a asynchronous execution model that fits well with the structure of our FMM code. Unlike previous work on asynchronous fast N-body methods such as ChaNGa and PEPC, the present work performs a direct comparison against the traditional bulk-synchronous approach and the asynchronous approach using Charm++. Furthermore, the serial performance of our FMM code is over an order of magnitude better than these previous codes, so it is much more challenging to hide the overhead of Charm++.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Simple Communication Scheme for Distributed Fast Multipole Methods

    cs.DC 2026-04 unverdicted novelty 4.0

    A simple MPI-based scheme for distributed uniform-tree FMMs achieves weak scaling to 3.2e10 points on 512 nodes while preserving shared-memory optimizations.