ITHICA generates functional tests via intra-thread instruction duplication and comparison, detecting 39% more defective servers than baseline methods on over 3000 real CPUs while revealing new defect behaviors.
Detecting silent data corruptions in the wild
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Cerberus unifies on-die, link, and system ECC layers via an Encode-Once Decode-Many architecture to improve resilience and cut redundant overhead.
AIReSim is a discrete event simulator for evaluating failure mitigation, recovery, and capacity planning decisions in large AI clusters.
citing papers explorer
-
ITHICA: Intra-Thread Instruction Checking Approach for Defect-Induced Silent Data Corruptions
ITHICA generates functional tests via intra-thread instruction duplication and comparison, detecting 39% more defective servers than baseline methods on over 3000 real CPUs while revealing new defect behaviors.
-
Cerberus: Cross-Layer ECC Co-Design for Robust and Efficient Memory Protection
Cerberus unifies on-die, link, and system ECC layers via an Encode-Once Decode-Many architecture to improve resilience and cut redundant overhead.
-
AIReSim: A Discrete Event Simulator for Large-scale AI Cluster Reliability Modeling
AIReSim is a discrete event simulator for evaluating failure mitigation, recovery, and capacity planning decisions in large AI clusters.