Fast, good, and repeatable: Summations, vectorization, and reproducibility

Periodicals(2020)

引用 2|浏览4
暂无评分
摘要
AbstractEnhanced-precision global sums are key to reproducibility in exascale applications. We examine two classic summation algorithms and show that vectorized versions are fast, good and reproducible at exascale. Both 256-bit and 512-bit implementations speed up the operation by almost a factor of four over the serial version. They thus demonstrate improved performance on global summations while retaining the numerical reproducibility of these methods.
更多
查看译文
关键词
Reproducibility, vectorization, self-compensated summation, enhanced precision, reproducible sums
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要