Taming Parallel I/O Complexity With Auto-Tuning

Babak Behzad, Huong Vu Thanh Luu,Joseph Huchette,Surendra Byna, Prabhat, Ruth Aydt,Quincey Koziol,Marc Snir

SC13: International Conference for High Performance Computing, Networking, Storage and Analysis Denver Colorado November, 2013(2013)

引用 145|浏览87
暂无评分
摘要
We present an auto-tuning system for optimizing I/O performance of HDF5 applications and demonstrate its value across platforms, applications, and at scale. The system uses a genetic algorithm to search a large space of tunable parameters and to identify effective settings at all layers of the parallel I/O stack. The parameter settings are applied transparently by the auto-tuning system via dynamically intercepted HDF5 calls.To validate our auto-tuning system, we applied it to three I/O benchmarks (VPIC, VORPAL, and GCRM) that replicate the I/O activity of their respective applications. We tested the system with different weak-scaling configurations (128, 2048, and 4096 CPU cores) that generate 30 GB to 1 TB of data, and executed these configurations on diverse HPC platforms (Cray XE6, IBM BC/P, and Dell Cluster). In all cases, the auto-tuning framework identified tunable parameters that substantially improved write performance over default system settings. We consistently demonstrate I/O write speedups between 2x and 100x for test configurations.
更多
查看译文
关键词
computational complexity,genetic algorithms,input-output programs,parallel processing,program testing,HDF5 applications,IO write speedups,auto-tuning system,default system settings,diverse HPC platforms,dynamically intercepted HDF5 calls,genetic algorithm,parallel IO complexity,parallel IO stack,test configurations,tunable parameters,weak-scaling configurations,Auto-Tuning,Parallel I/O,Parallel file systems,Performance Optimization,
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要