HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms

Alessio Burrello; Daniele Jahier Pagliari; Francesco Conti; Giuseppe Maria Sarda; Josse Van Delm; Luca Benini; Maarten Vandersteegen; Marian Verhelst

arxiv: 2406.07453 · v1 · pith:ICZDRFHZnew · submitted 2024-06-11 · 💻 cs.PL · cs.DC

HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms

Josse Van Delm , Maarten Vandersteegen , Alessio Burrello , Giuseppe Maria Sarda , Francesco Conti , Daniele Jahier Pagliari , Luca Benini , Marian Verhelst This is my paper

classification 💻 cs.PL cs.DC

keywords deploymentheterogeneoushtvmacceleratorsneuralsocstinytinyml

0 comments

read the original abstract

Optimal deployment of deep neural networks (DNNs) on state-of-the-art Systems-on-Chips (SoCs) is crucial for tiny machine learning (TinyML) at the edge. The complexity of these SoCs makes deployment non-trivial, as they typically contain multiple heterogeneous compute cores with limited, programmer-managed memory to optimize latency and energy efficiency. We propose HTVM - a compiler that merges TVM with DORY to maximize the utilization of heterogeneous accelerators and minimize data movements. HTVM allows deploying the MLPerf(TM) Tiny suite on DIANA, an SoC with a RISC-V CPU, and digital and analog compute-in-memory AI accelerators, at 120x improved performance over plain TVM deployment.

This paper has not been read by Pith yet.

HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms

discussion (0)