Learning Flexible GEMM Accelerator Conﬁguration and Mapping-space using ML

semanticscholar（2022）

引用 0|浏览0

暂无评分

摘要

—The value of ﬂexibility in Deep Learning accel- erators to adapt to diverse layer shapes and sizes is well-understood. Contemporary reconﬁgurable architectures depend on compilers or other components in the software stack for optimal conﬁguration and mapping search to fully exploit the beneﬁts of ﬂexibility. In this paper we show that the conﬁguration and mapping space of ﬂexible accelerators can be learnt using machine learning by casting it as a classiﬁcation or recommendation problem. The learnt model can be used to obtain the optimal conﬁguration of the target accelerator in constant time without search . We propose A DAPT N ET , a recommender system for obtaining optimal conﬁguration and mapping for GEMM workloads running on a R ECONFIGURABLE S YSTOLIC A RRAY ( RSA ). RSA is designed to be conﬁgured such that it can operate across a spectrum from a single monolithic array to a distributed collection of smaller arrays of various sizes with ﬂexible aspect ratios. This allows us to simultaneously achieve scalability and high mapping ﬂexibility while preserving operand reuse. A DAPT N ET demonstrates 95% test accuracy compared to an exhaustively searched optimal conﬁguration, beating state-of-the-art classiﬁcation techniques such as SVMs, XGBoost and MLPs. We also present, A DAPT N ET X, a specialized core to run A DAPT N ET in hardware. Together, RSA and A DAPT N ET X enable us to demonstrate a new class of ﬂexible accelerators which are capable of self-conﬁguring in hardware for the given GEMM workload. We present a 32.768 TOPS instance called SAGAR that is capable of providing the same mapping ﬂexibility as a compute equivalent distributed system while achieving 3.5 × more power efﬁciency and 3.2

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要