A Performance Model for GPUs with Caches
IEEE Transactions on Parallel and Distributed Systems(2015)
摘要
To exploit the abundant computational power of the world's fastest supercomputers, an even workload distribution to the typically heterogeneous compute devices is necessary. While relatively accurate performance models exist for conventional CPUs, accurate performance estimation models for modern GPUs do not exist. This paper presents two accurate models for modern GPUs: a sampling-based linear mo...
更多查看译文
关键词
Graphics processing units,Kernel,Computational modeling,Computer architecture,Hardware,Data models,Estimation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络