AIDA: Associative DNN Inference Accelerator

Leonid Yavits; Ran Ginosar; Roman Kaplan

arxiv: 1901.04976 · v1 · pith:GCPEJR7Pnew · submitted 2018-12-20 · 💻 cs.DC

AIDA: Associative DNN Inference Accelerator

Leonid Yavits , Roman Kaplan , Ran Ginosar This is my paper

classification 💻 cs.DC

keywords aidainferenceacceleratorassociativeacceleratingareaarithmeticarrays

0 comments

read the original abstract

We propose AIDA, an inference engine for accelerating fully-connected (FC) layers of Deep Neural Network (DNN). AIDA is an associative in-memory processor, where the bulk of data never leaves the confines of the memory arrays, and processing is performed in-situ. AIDA area and energy efficiency strongly benefit from sparsity and lower arithmetic precision. We show that AIDA outperforms the state of art inference accelerator, EIE, by 14.5x (peak performance) and 2.5x (throughput).

This paper has not been read by Pith yet.

AIDA: Associative DNN Inference Accelerator

discussion (0)