An Evaluation of Two Implementation Strategies for Optimizing One-Sided Atomic Reduction

  • Authors:
  • Jarek Nieplocha;Vinod Tipparaju;Edoardo Apra

  • Affiliations:
  • Pacific Northwest National Laboratory;Pacific Northwest National Laboratory;Pacific Northwest National Laboratory

  • Venue:
  • IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

One-sided atomic reduction, also known as the accumulate operation, combines atomically a content of the local buffer with data at remote memory location. This operation has been included in the MPI-2 standard a, MPI_Accumulate. The current paper discusses two strategies for implementing one-sided atomic reduction called owner-computes and callercomputes. Performance of these two schemes has been investigated on the HP Alphaserver SC45 and HP zx-2600 clusters both equipped with the Quadrics Elan-3 network.