Performance Optimisation of Smoothed Particle Hydrodynamics Algorithms for Multi/Many-Core Architectures

2017 International Conference on High Performance Computing & Simulation (HPCS)(2017)

引用 4|浏览16
暂无评分
摘要
We describe a strategy for code modernisation of Gadget, a widely used community code for computational astrophysics. The focus of this work is on node-level performance optimisation, targeting current multi/many-core Intel® architectures. We identify and isolate a sample code kernel, which is representative of a typical Smoothed Particle Hydrodynamics (SPH) algorithm. The code modifications include threading parallelism optimisation, change of the data layout into Structure of Arrays (SoA), auto-vectorisation and algorithmic improvements in the particle sorting. We obtain shorter execution time and improved threading scalability both on Intel Xeon® (2.6× on Ivy Bridge) and Xeon Phi™ (13.7× on Knights Corner) systems. First few tests of the optimised code result in 19.1× faster execution on second generation Xeon Phi (Knights Landing), thus demonstrating the portability of the devised optimisation solutions to upcoming architectures.
更多
查看译文
关键词
Performance optimisation,SPH,OpenMP,vectorisation,Intel Xeon,Intel Xeon Phi,KNC,KNL
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要