Analog-memory-based 14nm Hardware Accelerator for Dense Deep Neural Networks including Transformers.

Atsuya Okazaki,Pritish Narayanan,Stefano Ambrogio,Kohji Hosokawa,Hsinyu Tsai,Akiyo Nomura,Takeo Yasuda,Charles Mackin,Alexander M. Friz,Masatoshi Ishii,Yasuteru Kohda,Katie Spoon,An Chen,Andrea Fasoli,Malte J. Rasch,Geoffrey W. Burr

ISCAS（2022）

引用 1|浏览29

暂无评分

摘要

Analog non-volatile memory (NVM)-based accelerators for deep neural networks perform high-throughput and energy-efficient multiply-accumulate (MAC) operations (e.g., high TeraOPS/W) by taking advantage of massively parallelized analog MAC operations, implemented with Ohm's law and Kirchhoff's current law on array-matrices of resistive devices. While the wide-integer and floating-point operations offered by conventional digital CMOS computing are much more suitable than analog computing for conventional applications that require high accuracy and true reproducibility, deep neural networks can still provide competitive end-to-end results even with modest (e.g., 4-bit) precision in synaptic operations. In this paper, we describe a 14-nm inference chip, comprising multiple 512 similar to 512 arrays of Phase Change Memory (PCM) devices, which can deliver software-equivalent inference accuracy for MNIST handwrittendigit recognition and recurrent LSTM benchmarks, by using compensation techniques to finesse analog-memory challenges such as conductance drift and noise. We also project accuracy for Natural Language Processing (NLP) tasks performed with a state-of-art large Transformer-based model, BERT, when mapped onto an extended version of this same fundamental chip architecture.

查看译文

关键词

Phase-change memory,Non-volatile memory,inference,analog multiply-accumulate for DNNs,analog AI,deep learning accelerator,BERT,Transformer

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要