Flexible Software Profiling Of Gpu Architectures

ISCA(2015)

引用 127|浏览161
暂无评分
摘要
To aid application characterization and architecture design space exploration, researchers and engineers have developed a wide range of tools for CPUs, including simulators, profilers, and binary instrumentation tools. With the advent of GPU computing, GPU manufacturers have developed similar tools leveraging hardware profiling and debugging hooks. To date, these tools are largely limited by the fixed menu of options provided by the tool developer and do not offer the user the flexibility to observe or act on events not in the menu. This paper presents SASSI (NVIDIA assembly code "SASS" Instrumentor), a low-level assembly-language instrumentation tool for GPUs. Like CPU binary instrumentation tools, SASSI allows a user to specify instructions at which to inject user-provided instrumentation code. These facilities allow strategic placement of counters and code into GPU assembly code to collect user-directed, fine-grained statistics at hardware speeds. SASSI instrumentation is inherently parallel, leveraging the concurrency of the underlying hardware. In addition to the details of SASSI, this paper provides four case studies that show how SASSI can be used to characterize applications and explore the architecture design space along the dimensions of instruction control flow, memory systems, value similarity, and resilience.
更多
查看译文
关键词
flexible software profiling,GPU architectures,application characterization,architecture design space exploration,simulators,profilers,GPU computing,GPU manufacturers,hardware profiling,debugging hooks,SASSI,NVIDIA assembly code,SASS instrumentor,low-level assembly-language instrumentation tool,CPU binary instrumentation tools,user-provided instrumentation code,counters,GPU assembly code,user-directed fine-grained statistics,hardware speeds,instruction control flow,memory systems,value similarity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要