GPU Porting of Scalable Implicit Solver with Green’s Function-Based Neural Networks by OpenACC

Accelerator Programming Using Directives(2022)

引用 0|浏览24
暂无评分
摘要
With the development of diverse computer architectures and diverse HPC applications, it is desirable to make performance portable applications that run on multiple architectures with relatively low development cost. Directive based programming models such as OpenACC have been developed for such purpose, and have been used successfully to port many equation-based HPC applications. As an example of porting of a class of HPC applications comprising both data-analytics methods and equation-based methods, we port an implicit solver with a neural network (NN)-type preconditioner for solving large-scale partial differential equation (PDE)-based problems. The scalable preconditioner is based on the Green’s functions reflecting properties of the target PDE, which improves the accuracy and efficiency of using NNs for solving PDE-based problems. By kernel algorithm design suitable for the computer architecture and use of OpenACC, we enabled high performance on recent GPUs with relatively low development cost. Here, 64.4% of FP64 peak was obtained on NVIDIA A100 GPU-equipped nodes of AI Bridging Cloud Infrastructure at National Institute of Advanced Industrial Science and Technology, leading to 2.54-fold speedup from a highly-tuned GPU implementation of a widely used PDE solver algorithm and 38.9-fold speedup from OpenMP-based CPU implementation running on the same system. Furthermore, 83.4% weak scalability was obtained from 8 to 256 A100 GPUs on the same system, enabling solving large scale problems of up to 25.7 billion degrees-of-freedom with high performance.
更多
查看译文
关键词
Performance portability, OpenACC, GPU computation, Implicit solver, Neural network-based preconditioning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要