谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Increasing double precision throughput on NVIDIA Maxwell GPUs.

SpringSim (HPS)(2016)

引用 0|浏览12
暂无评分
摘要
This paper deals with the impact the architectural changes of modern GPUs have on their use in scientific computing. It particularly focuses on significant drops in the number of double precision functional units in NVIDIA Maxwell architecture. Proposed remedies of the potential negative impact on GPGPU applications that are based on multiple precision arithmetics are discussed. Two new algorithms for fast and precise multiplication and fused multiply add for double precision arithmetics emulation are also presented here. Using these methods, we were able to boost the double precision performance of NVIDIA GTX 980 Ti from 95 GFLOPS up to 286 GFLOPS. The proposed methods are applicable also to other GPUs.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要