Space Efficient Sequence Alignment for SRAM-Based Computing: X-Drop on the Graphcore IPU

Luk Burchard,Max Xiaohang Zhao,Johannes Langguth,Aydın Buluç,Giulia Guidi

SC23: International Conference for High Performance Computing, Networking, Storage and Analysis（2023）

引用 0|浏览1

暂无评分

摘要

Dedicated accelerator hardware has become essential for processing AI-based workloads, leading to the rise of novel accelerator architectures. Furthermore, fundamental differences in memory architecture and parallelism have made these accelerators targets for scientific computing. The sequence alignment problem is fundamental in bioinformatics; we have implemented the X-Drop algorithm, a heuristic method for pairwise alignment that reduces search space, on the Graphcore Intelligence Processor Unit (IPU) accelerator. The X-Drop algorithm has an irregular computational pattern, which makes it difficult to accelerate due to load balancing. Here, we introduce a graph-based partitioning and queue-based batch system to improve load balancing. Our implementation achieves l Ox speedup over a state-of-the-art GPU implementation and up to 4.65x compared to CPU. In addition, we introduce a memory-restricted X-Drop algorithm that reduces memory footprint by 55x and efficiently uses the IPU's limited low-latency SRAM. This optimization further improves strong scaling by 3.6x.

查看译文

关键词

Sequence Alignment,Heuristic,Search Space,Graphics Processing Unit,Pairwise Alignment,Load Balancing,Hardware Accelerators,Alignment Problem,Strong Scaling,Sequence Length,Parallelization,Real-world Data,Local Alignment,Sparse Matrix,Upper Left Corner,Work Unit,Local Memory,Long-read Technologies,Dynamic Matrix,Single Tile,Anti-diagonal,Optimal Alignment,Entire Matrix,Total Execution Time,Graph Partitioning,Race Conditions,Single Alignment,Clock Frequency

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要