OpenCL implementation of a high performance 3D Peridynamic model on graphics accelerators.

Farshid Mossaiby,A. Shojaei,M. Zaccariotto,U. Galvanetto

Computers & Mathematics with Applications（2017）

引用 50|浏览46

暂无评分

摘要

Parallel processing is one of the major trends in the computational mechanics community. Due to inherent limitations in processor design, manufacturers have shifted towards the multi- and many-core architectures. The graphics processing units (GPUs) are gaining more and more popularity due to high availability and processing power as well as maturity of development tools and community experience. In this research we describe a rather general approach to using OpenCL implementation of 3D Peridynamics model on GPU platform. Peridynamics is a non-local continuum theory for describing the behavior of material used especially when damage and crack nucleation or propagation is of interest. The steps taken for developing an OpenMP code from the serial one as well as the comparison between OpenCL and OpenMP codes are provided. Optimization techniques and their effects on the performance of the code are described. The implementations are tested on some 3D benchmarks with hundred of thousands to millions of nodes. The behavior of codes in terms of being memory or compute bound are analyzed. In all test cases reported, the OpenCL implementation consistently outperforms serial and OpenMP ones and paves the road for the development of high performance Peridynamics codes.

查看译文

关键词

Peridynamics,Parallel implementation,OpenCL,OpenMP,Graphics accelerators

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要