Pushing The Limits Of Accelerator Efficiency While Retaining Programmability

PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA-22)(2016)

引用 66|浏览111
暂无评分
摘要
The waning benefits of device scaling have caused a push towards domain specific accelerators (DSAs), which sacrifice programmability for efficiency. While providing huge benefits, DSAs are prone to obsoletion due to domain volatility, have recurring design and verification costs, and have large area footprints when multiple DSAs are required in a single device. Because of the benefits of generality, this work explores how far a programmable architecture can be pushed, and whether it can come close to the performance, energy, and area efficiency of a DSA-based approach.Our insight is that DSAs employ common specialization principles for concurrency, computation, communication, data-reuse and coordination, and that these same principles can be exploited in a programmable architecture using a composition of known microarchitectural mechanisms. Specifically, we propose and study an architecture called LSSD, which is composed of many low-power and tiny cores, each having a configurable spatial architecture, scratchpads, and DMA.Our results show that a programmable, specialized architecture can indeed be competitive with a domain-specific approach. Compared to four prominent and diverse DSAs, LSSD can match the DSAs' 10x to 150x speedup over an OOO core, with only up to 4x more area and power than a single DSA, while retaining programmability.
更多
查看译文
关键词
accelerator efficiency analysis,programmability,domain specific accelerators,domain volatility,design cost,verification cost,programmable architecture,performance analysis,energy analysis,area efficiency analysis,DSA-based approach,concurrency principle,computation principle,communication principle,data-reuse principle,coordination principle,microarchitectural mechanism,LSSD architecture,configurable spatial architecture,scratchpads,DMA,programmable-specialized architecture,domain-specific approach
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要