A 1W8R 20T SRAM Codebook for 20% Energy Reduction in Mixed-Precision Deep-Learning Inference Processor System.

Ryotaro Ohara,Masaya Kabuto,Masakazu Taichi, Atsushi Fukunaga,Yuto Yasuda,Riku Hamabe,Shintaro Izumi,Hiroshi Kawaguchi

AICAS（2023）

引用 0|浏览4

暂无评分

摘要

This study introduces a 1W8R 20T multiport memory for codebook quantization in deep-learning processors. We manufactured the memory in a 40 nm process and achieved memory read-access time at 2.75 ns and 2.7-pj/byte power consumption. In addition, we used NVDLA, which was NVIDIA’s deep-learning processor, as a motif and simulated it based on the power obtained from the actual proposed memory. The obtained power and area reduction results are 20.24% and 26.24%, respectively.

查看译文

关键词

codebook,multiport memory,deep neural network

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要