High performance 3-D FFT using multiple CUDA GPUs

  • Authors:
  • Akira Nukada;Yutaka Maruyama;Satoshi Matsuoka

  • Affiliations:
  • Tokyo Institute of Technology;Institute for Molecular Science;Tokyo Institute of Technology/National Institute of Informatics

  • Venue:
  • Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Fast Fourier transform is one of the most important computations used in many kinds of applications. Although there are several works of on single GPU FFT, we also need large-scale transforms that require multiple GPUs due to the capacity of the device memory. We present high performance 3-D FFT using multiple GPU devices both on a single node and on multiple nodes. As a result of optimizing the data transfer between GPUs, our multi GPU FFT successfully outperform single GPU.