HTVM: Efficient Neural Network Deployment On Heterogeneous TinyML Platforms.

Josse Van Delm, Maarten Vandersteegen,Alessio Burrello,Giuseppe Maria Sarda,Francesco Conti,Daniele Jahier Pagliari,Luca Benini,Marian Verhelst

DAC（2023）

引用 2|浏览11

暂无评分

摘要

Optimal deployment of deep neural networks (DNNs) on state-of-the-art Systems-on-Chips (SoCs) is crucial for tiny machine learning (TinyML) at the edge. The complexity of these SoCs makes deployment non-trivial, as they typically contain multiple heterogeneous compute cores with limited, programmer-managed memory to optimize latency and energy efficiency. We propose HTVM - a compiler that merges TVM with DORY to maximize the utilization of heterogeneous accelerators and minimize data movements. HTVM allows deploying the MLPerfT Tiny suite on DIANA, an SoC with a RISC-V CPU, and digital and analog compute-in-memory AI accelerators, at 120x improved performance over plain TVM deployment.

查看译文

关键词

Compilers,Convolutional Neural Networks,Heterogeneous Computing,Deep Learning Accelerators

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要