Batched QR and SVD Algorithms on GPUs with Applications in Hierarchical Matrix Compression.

Wajih Halim Boukaram,George Turkiyyah,Hatem Ltaief,David E. Keyes

Parallel Computing（2018）

引用 48|浏览41

暂无评分

摘要

•High performance GPU hosted batched QR decomposition kernels are developed and outperform current implementations for small and rectangular matrices.•Various GPU hosted batched singular value decomposition kernels are developed and used as building blocks of a batched randomized SVD kernel for numerically low rank matrix blocks.•Batched QR, SVD, and GEMM kernels are used to compress hierarchical matrices entirely on the GPU.

查看译文

关键词

GPU,QR,SVD,Batched operations,Hierarchical,Compression

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要