Optimization strategy for a performance portable Vlasov code

2021 International Workshop on Performance, Portability and Productivity in HPC (P3HPC)(2021)

引用 1|浏览1
暂无评分
摘要
This paper presents optimization strategies applied on a kinetic plasma simulation code that makes use of Ope-nACC/OpenMP directives and Kokkos performance portable framework to run across multiple CPUs and GPUs. We evaluate the impacts of optimizations on multiple hardware platforms: Intel Xeon Skylake, Fujitsu Arm A64FX, and Nvidia Tesla P100 and V100. With vectorization and cache tuning, the Op...
更多
查看译文
关键词
Intel Skylake,Nvidia P100,Nvidia V100,Fujitsu A64FX,Semi-Lagrangian,Kokkos,OpenACC,OpenMP
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要