Trident-CIM: A LUT-Based Compute-in-Memory Macro With Trident Read Bit-Line and Partial Product Pruning

IEEE Transactions on Circuits and Systems II: Express Briefs(2024)

引用 0|浏览1
暂无评分
摘要
With the massive data transfer of deep neural network (DNN) models hindering their employment in energy-stringent scenarios, compute-in-memory (CIM) architectures boost energy efficiency with memory-logic coupling. The read-out computing logic and adder tree are the main contributors to overall CIM power consumption. This paper proposes a Trident-CIM macro, which implements LUT-based multiply-and-accumulate (MAC) computation. Assisted by read-out cells featuring trident read bit-lines, read-out power is shared among several memory rows. Moreover, with distribution-aware partial product pruning implemented with leading-one-bit detection and truncation, overall MAC power reduction is achieved. A 16kb Trident-CIM macro is implemented in 28nm CMOS technology for energy efficiency validation. Simulation results show that the proposed LUT-based MAC scheme with partial product pruning achieves an energy efficiency of 40.6 TOPS/W (@200MHz) and 37.9 TOPS/W (@400MHz) featuring INT8 data precision.
更多
查看译文
关键词
Compute-in-memory,read-out power reduction,partial product pruning,adder tree optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要