Parallel zero-copy algorithms for fast Fourier transform and conjugate gradient using MPI datatypes

  • Authors:
  • Torsten Hoefler;Steven Gottlieb

  • Affiliations:
  • National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL;National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL

  • Venue:
  • EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many parallel applications need to communicate noncontiguous data. Most applications manually copy (pack/unpack) data before communications even though MPI allows a zero-copy specification. In this work, we study two complex use-cases: (1) Fast Fourier Transformation where we express a local memory transpose as part of the datatype, and (2) a conjugate gradient solver with a checkerboard layout that requires multiple nested datatypes. We demonstrate significant speedups up to a factor of 3.8 and 18%, respectively, in both cases. Our work can be used as a template to utilize datatypes for application developers. For MPI implementers, we show two practically relevant access patterns that deserve special optimization.