Automatic Parallelization for Graphics Processing Units in JikesRVM

msra(2008)

引用 24|浏览4
暂无评分
摘要
Abstract Accelerated graphics cards, including specialized high-performance processors called Graphics Processing Units (GPUs), have become ubiquitous in recent years. On the right kinds of problems, GPUs greatly surpass CPUs in terms of raw performance. However, GPUs are currently used only for a narrow class of special-purpose appli- cations; the raw processing power available in a typical desktop PC is unused most of the time. The goal of this work is to present an extension to JikesRVM that automatically executes suitable code on the GPU instead of the CPU. Both static and dynamic features are used to decide whether it is feasible and benecial,to o-load,a piece of code to the GPU. Feasible code is discovered by an implementation of data dependence analysis. A cost model that balances the speedup available from the GPU against the cost of transferring input and output data between main memory and GPU memory,has been deployed to determine if a feasible parallelization is indeed benecial.,The cost model is parameterized so that it can be applied to dierent,hardware combinations. We also present ways to overcome several obstacles to parallelization inherent in the design of the Java bytecode language: unstructured control,ow, the lack of
更多
查看译文
关键词
java,compiler,automatic parallelization,optimization,virtual machine
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要