FFTs and multiple collective communication on multiprocessor-node architectures

  • Authors:
  • Andreas Jocksch

  • Affiliations:
  • CSCS, Swiss National Supercomputing Centre, Manno, Switzerland

  • Venue:
  • PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider FFTs for networks with multiprocessor nodes using 2D data decomposition. In this application, processors perform collective all-to-all communication in different groups independently at the same time. Thus the individual processors of the nodes might be involved in independent collective communication. The underlying communication algorithm should account for that fact. For short messages, we propose a sparse version of Bruck's algorithm which handles such multiple collectives. The distribution of the FFT data to the nodes is discussed for the local and global application of Bruck's original algorithm, as well as the suggested sparse version. The performance of the different approaches is compared.