Using shared-data localization to reduce the cost of inspector-execution in unified-parallel-C programs.

Parallel Computing(2016)

引用 1|浏览71
暂无评分
摘要
•We improve performance of fine-grain UPC applications by orders of magnitude.•We introduce a novel shared-data localization transformation.•We present a thorough performance analysis and evaluation.•We show that reducing run-time calls is crucial for performance.•We achieve performance comparable to C and MPI using the UPC programming model.
更多
查看译文
关键词
Unified Parallel C,Partitioned global address space,Compiler optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要