Charm++ techniques enable efficient overdecomposition on multi-vendor GPGPU distributed systems.
Fine-grained automated failure management for extreme-scale gpu accelerated systems
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
fields
cs.DC 2years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
Aurora reached 1.01 EF/s FP64 HPL and 11.64 EF/s HPL-MxP through locality-aware mapping, CPU-GPU pipelining, mixed-precision orchestration, and hybrid resilience on a large Intel GPU-based system.
citing papers explorer
-
Efficient and Portable Support for Overdecomposition on Distributed Memory GPGPU Platforms
Charm++ techniques enable efficient overdecomposition on multi-vendor GPGPU distributed systems.
-
Sustaining Exascale Performance: Lessons from HPL and HPL-MxP on Aurora
Aurora reached 1.01 EF/s FP64 HPL and 11.64 EF/s HPL-MxP through locality-aware mapping, CPU-GPU pipelining, mixed-precision orchestration, and hybrid resilience on a large Intel GPU-based system.