A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems

  • Authors:
  • Jesper Larsson Träff;Andreas Ripke;Christian Siebert;Pavan Balaji;Rajeev Thakur;William Gropp

  • Affiliations:
  • NEC Laboratories Europe, NEC Europe Ltd., Sankt Augustin, Germany D-53757;NEC Laboratories Europe, NEC Europe Ltd., Sankt Augustin, Germany D-53757;NEC Laboratories Europe, NEC Europe Ltd., Sankt Augustin, Germany D-53757;Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, USA IL 60439;Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, USA IL 60439;Department of Computer Science, University of Illinois, Urbana, USA IL 61801

  • Venue:
  • Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

We present and evaluate a new, simple, pipelined algorithm for large, irregularall-gather problems, useful for the implementation of the MPI_Allgathervcollective operation of MPI. The algorithm can be viewed as an adaptation of a linear ring algorithm for regular all-gather problems for single-ported, clustered multiprocessors to the irregular problem. Compared to the standard ring algorithm, whose performance is dominated by the largest data size broadcast by a process (times the number of processes), the performance of the new algorithm depends only on the total amount of data over all processes. The new algorithm has been implemented within different MPI libraries. Benchmark results on NEC SX-8, Linux clusters with InfiniBand and Gigabit Ethernet, Blue Gene/P, and SiCortex systems show huge performance gains in accordance with the expected behavior.