Efficient and Portable Workgroup Size Tuning.

IEEE Transactions on Parallel and Distributed Systems(2020)

引用 2|浏览21
暂无评分
摘要
The performance of an OpenCL program is strongly influenced by both hardware and software attributes. To achieve superior performance, developers may leverage automatic performance tuning techniques to determine the optimal parameters on the target device. Although existing approaches have shown promising tuning results in their target scenarios, other requirements such as efficiency, portability,...
更多
查看译文
关键词
Tuning,Performance evaluation,Kernel,Hardware,Indexes,Computational modeling,Graphics processing units
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要