SME-aware kernel and hybrid execution optimizations for SPECFEM3D on LX2 ARM processors deliver 4-6x speedup and shift the favorable (h,p) operating point to higher orders along the dispersion-based iso-accuracy frontier.
Optimizing computation-communication overlap in asynchronous task-based programs,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.DC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
High-Order Spectral Element Methods for Wave Propagation on ARM Multicore CPU with SME: Optimizations and Implications
SME-aware kernel and hybrid execution optimizations for SPECFEM3D on LX2 ARM processors deliver 4-6x speedup and shift the favorable (h,p) operating point to higher orders along the dispersion-based iso-accuracy frontier.