arxiv: 1809.10778 · v1 · pith:W64QWVNBnew · submitted 2018-09-27 · 💻 cs.DC

Performance of MPI sends of non-contiguous data

Victor Eijkhout This is my paper

classification 💻 cs.DC

keywords derivedmessagesperformancebufferbufferingcausescombinationcomparably

0 comments

read the original abstract

We present an experimental investigation of the performance of MPI derived datatypes. For messages up to the megabyte range most schemes perform comparably to each other and to manual copying into a regular send buffer. However, for large messages the internal buffering of MPI causes differences in efficiency. The optimal scheme is a combination of packing and derived types.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Routing-Based Continual Learning for Multimodal Large Language Models
cs.LG 2025-11 unverdicted novelty 6.0

Routing architecture for MLLMs enables continual learning with constant compute, matching multi-task learning performance and supporting cross-modal transfer.