谷歌浏览器插件
订阅小程序
在清言上使用

CFM: SIMT Thread Divergence Reduction by Melding Similar Control-Flow Regions in GPGPU Programs

arXiv (Cornell University)(2021)

引用 0|浏览0
暂无评分
摘要
GPGPUs use the Single-Instruction-Multiple-Thread (SIMT) execution model where a group of threads--wavefront or war--execute instructions in lockstep. When threads in a group encounter a branching instruction, not all threads in the group take the same path, a phenomenon known as control-flow divergence. The control-flow divergence causes performance degradation because both paths of the branch must be executed one after the other. Prior research has primarily addressed this issue through architectural modifications. We observe that certain GPGPU kernels with control-flow divergence have similar control-flow structures with similar instructions on both sides of a branch. This structure can be exploited to reduce control-flow divergence by melding the two sides of the branch allowing threads to reconverge early, reducing divergence. In this work, we present CFM, a compiler analysis and transformation framework that can meld divergent control-flow structures with similar instruction sequences. We show that CFM can reduce the performance degradation from control-flow divergence.
更多
查看译文
关键词
GPU Computing,Parallel Computing,Control-Flow Integrity,Multicore Architectures
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要