Chrome Extension
WeChat Mini Program
Use on ChatGLM

Prospects for scalable 3D FFTs on heterogeneous exascale systems

user-5dd528d2530c701191bf1b49(2011)

Cited 0|Views3
No score
Abstract
We consider the problem of implementing scalable threedimensional fast Fourier transforms with an eye toward future exascale systems comprised of graphics co-processor (GPUs) or other similarly high-density compute units. We describe a new software implementation; derive and calibrate a suitable analytical performance model; and use this model to make predictions about potential outcomes at exascale, based on current and likely technology trends. We evaluate the scalability of our software and instantiate models on real systems, including 64 nodes (192 NVIDIA“Fermi” GPUs) of the Keeneland system at Oak Ridge National Laboratory. We use our analytical model to quantify the impact of both interand intra-node communication that impede further scalability. Among various observations, a key prediction is that although inter-node all-to-all communication is expected to be the bottleneck of distributed FFTs, it is actually intra-node communication that may play an even more critical role.
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined