A scalable queue for work distribution on GPUs.

Bernhard Kerbl,Jörg Müller,Michael Kenzel,Dieter Schmalstieg,Markus Steinberger

PPOPP（2018）

引用 1|浏览93

暂无评分

摘要

Harnessing the power of massively parallel devices like the graphics processing unit (GPU) is difficult for algorithms that show dynamic or inhomogeneous workloads. To achieve high performance, such advanced algorithms require scalable, concurrent queues to collect and distribute work. We present a new concurrent work queue, the Broker Queue, a highly efficient, linearizable queue for fine-granular work distribution on the GPU. We evaluate its usability and benefits in contrast to existing queuing algorithms. Our queue is up to one order of magnitude faster than non-blocking queues, and outperforms simpler queue designs that are unfit for fine-granular work distribution.

查看译文

关键词

GPU, concurrent, parallel, queuing, scheduling

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要