Parallel GPU architecture simulation framework exploiting work allocation unit parallelism

ISPASS(2013)

引用 33|浏览39
暂无评分
摘要
GPU computing is at the forefront of high-performance computing, and it has greatly affected current studies on parallel software and hardware design because of its massively parallel architecture. Therefore, numerous studies have focused on the utilization of GPUs in various fields. However, studies of GPU architectures are constrained by the lack of a suitable GPU simulator. Previously proposed GPU simulators do not have sufficient simulation speed for advanced software and architecture studies. In this paper, we propose a new parallel simulation framework and a parallel simulation technique called work-group parallel simulation in order to improve the simulation speed for modern many-core GPUs. The proposed framework divides the GPU architecture into parallel and shared components, and it determines which GPU component can be effectively parallelized and can work correctly in multithreaded simulation. In addition, the work-group parallel simulation technique effectively boosts the performance of parallelized GPU simulation by eliminating the synchronization overhead. Experimental results obtained using a simulator with the proposed framework show that the proposed parallel simulation technique has a speed-up of up to 4.15 as compared to an existing sequential GPU simulator on an 8-core machine providing minimized cycle errors.
更多
查看译文
关键词
many-core gpu,multithreaded simulation,shared component,parallelized gpu simulation performance,gpu computing,parallel component,work-group parallel simulation technique,gpu simulator,high-performance computing,synchronization overhead elimination,gpu component,parallel architectures,graphics processing units,multi-threading,multiprocessing systems,work allocation unit parallelism,hardware design,parallel software,parallel gpu architecture simulation framework,parallel architecture,synchronisation,computational modeling,high performance computing,process control,instruction sets,multi threading
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要