pith. sign in

arxiv: 1806.09679 · v1 · pith:N4TXNJS5new · submitted 2018-06-14 · 💻 cs.LG · cs.AR· stat.ML

On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation

classification 💻 cs.LG cs.ARstat.ML
keywords acceleratorscharacterizationdatafaultmitigationfaultshardwaremassive
0
0 comments X
read the original abstract

Machine Learning (ML) is making a strong resurgence in tune with the massive generation of unstructured data which in turn requires massive computational resources. Due to the inherently compute- and power-intensive structure of Neural Networks (NNs), hardware accelerators emerge as a promising solution. However, with technology node scaling below 10nm, hardware accelerators become more susceptible to faults, which in turn can impact the NN accuracy. In this paper, we study the resilience aspects of Register-Transfer Level (RTL) model of NN accelerators, in particular, fault characterization and mitigation. By following a High-Level Synthesis (HLS) approach, first, we characterize the vulnerability of various components of RTL NN. We observed that the severity of faults depends on both i) application-level specifications, i.e., NN data (inputs, weights, or intermediate), NN layers, and NN activation functions, and ii) architectural-level specifications, i.e., data representation model and the parallelism degree of the underlying accelerator. Second, motivated by characterization results, we present a low-overhead fault mitigation technique that can efficiently correct bit flips, by 47.3% better than state-of-the-art methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. ML-based Fault Injection for Autonomous Vehicles: A Case for Bayesian Fault Injection

    cs.LG 2019-07 unverdicted novelty 6.0

    DriveFI, a Bayesian ML-based fault injection engine, identifies 561 safety-critical faults in AV systems in under 4 hours on NVIDIA and Baidu stacks, while random injection over weeks found none.

  2. Kayotee: A Fault Injection-based System to Assess the Safety and Reliability of Autonomous Vehicles to Faults and Errors

    cs.SE 2019-07 unverdicted novelty 4.0

    Kayotee is a fault injection-based tool and ontology model for assessing the safety and reliability of autonomous vehicles to faults and errors at hardware, software, vehicle, and traffic levels.