Precision-Aware Workload Distribution and Dataflow for a Hybrid Digital-CIM Deep CNN Accelerator

Jui-I Kao, Wei Lu,Po-Tsang Huang,Hung-Ming Chen

2022 19th International SoC Design Conference (ISOCC)(2022)

引用 0|浏览4
暂无评分
摘要
SRAM-based Computing-in-memory (CIM) circuits have been demonstrated as a promising solution to effectively accelerate the inference of convolutional neural networks (CNNs) by shifting computation into the memory arrays. However, the advantages of CIM accelerators will disappear as increasing the bit precision and adopting advanced process technology due to the overhead caused by ADC/DAC and poor technology scaling capability of analog circuits. In this paper, a hybrid digital-CIM accelerator was proposed to solve above problems and the weights and activations of different layers are quantized to different precision (high, medium, and low precision). Moreover, precision-aware workload distribution and dataflow are proposed for the hybrid digital-CIM accelerator. Overall, the proposed accelerator can achieve 12.481 TOPS/W.
更多
查看译文
关键词
hybrid digital-CIM,precision-aware
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要