StreamGuard uses dynamic checkpointing and load rebalancing to cut failure impact by up to 6x in HPC data streams while adding under 1% overhead in normal runs.
Smart, Emanuele Danovaro, Tiago Quintino, Dean Hildebrand, and Adrian Jackson
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.DC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
StreamGuard: Low-Overhead Resilience for Real-time HPC Data Streams
StreamGuard uses dynamic checkpointing and load rebalancing to cut failure impact by up to 6x in HPC data streams while adding under 1% overhead in normal runs.