Analyzing Performance Of Multi-Cores And Applications With Cache-Aware Roofline Model

2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS)(2017)

引用 0|浏览26
暂无评分
摘要
To satisfy growing computational demands of modern applications, significant enhancements have been introduced in the contemporary processor architectures with the aim to increase their attainable performance, such as increased number of cores, improved capability of memory subsystem and enhancements in the processor pipeline [1]. Therefore, the performance improvements are usually coupled with an increased complexity at the architecture level, which imposes additional challenges when designing, prototyping and optimizing the execution of real-world applications on a given compute platform. Since the application performance depends on multiple factors, e.g., multi-threading, vectorization efficiency and memory accesses, achieving the most efficient execution is not a trivial task, especially when aiming at fully exploiting the capabilities of modern multi-core processors.
更多
查看译文
关键词
multicores performance analysis,cache-aware roofline model,computational demands,contemporary processor architectures,memory subsystem,architecture level,compute platform,application performance,multithreading,vectorization efficiency,memory accesses,multicore processors
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要