PICO is a benchmarking framework for collective operations that decouples portable setup from platform execution, supplies reference MPI implementations, and shows default choices can be up to 5x slower with up to 44% end-to-end training time reductions in simulator replays.
Leonardo: A pan-european pre-exascale supercomputer for hpc and ai applications
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.DC 2verdicts
UNVERDICTED 2representative citing papers
This study empirically characterizes congestion responses in EDR/HDR/NDR InfiniBand, Cray Slingshot, and Ethernet fabrics under controlled steady and bursty collective communication patterns at multiple system scales.
citing papers explorer
-
PICO: Performance Insights for Collective Operations
PICO is a benchmarking framework for collective operations that decouples portable setup from platform execution, supplies reference MPI implementations, and shows default choices can be up to 5x slower with up to 44% end-to-end training time reductions in simulator replays.
-
Characterizing the Impact of Congestion in Modern HPC Interconnects
This study empirically characterizes congestion responses in EDR/HDR/NDR InfiniBand, Cray Slingshot, and Ethernet fabrics under controlled steady and bursty collective communication patterns at multiple system scales.