On optimizing collective communication

  • Authors:
  • E. W. Chan;M. F. Heimlich;A. Purkayastha;R. A. van de Geijn

  • Affiliations:
  • Univ. of Texas, Austin, TX, USA;Univ. of Texas, Austin, TX, USA;Univ. of Texas, Austin, TX, USA;Univ. of Texas, Austin, TX, USA

  • Venue:
  • CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We discuss issues related to the high-performance implementation of collective communications operations on distributed-memory computer architectures. Using a combination of known techniques (many of which were first proposed in the 1980s and early 1990s) along with careful exploitation of communication modes supported by MPI, we have developed implementations that have improved performance in most situations compared to those currently supported by public domain implementations of MPI such as MPICH. Performance results from a large Intel Pentium 4 (R) processor cluster are included.