Leveraging Hierarchical Data Locality in Parallel Programming Models

High Performance Computing and Communications, 2014 IEEE 6th Intl Symp Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf Embedded Software and Syst(2014)

引用 2|浏览0
暂无评分
摘要
We are proposing a novel framework that ameliorates locality-aware parallel programming models, by defining hierarchical data locality model extension. We also propose a hierarchical thread partitioning algorithm. This algorithm synthesizes hierarchical thread placement layouts that targets minimizing the program's overall communication costs. We demonstrated the effectiveness of our approach using NAS Parallel Benchmarks implemented in Unified Parallel C (UPC) language using a modified Berkeley UPC Compiler and runtime system. We demonstrated an up to 85% improvement in performance by applying the placement layout suggested by our algorithm.
更多
查看译文
关键词
unified c language,communication costs,c language,nas parallel benchmarks,multi-threading,hierarchical data locality model extension,parallel languages,hierarchical thread partitioning algorithm,modified berkeley upc compiler,upc language,many-cores,locality-aware parallel programming model,hierarchical thread clustering,runtime system,mobile computing,data locality,hierarchical thread placement layouts,program compilers,data models,benchmark testing,measurement,kernel
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要