Assessing the performance and scalability of a novel multilevel k-nomial allgather on CORE-Direct systems

  • Authors:
  • Joshua S. Ladd;Manjunath Gorentla Venkata;Richard Graham;Pavel Shamis

  • Affiliations:
  • Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN;Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN;Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN;Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN

  • Venue:
  • Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a novel allgather algorithm, Reindexed Recursive K-ing (RRK), which leverages flexibility in the algorithm's tree topology and ability to make asynchronous progress coupled with Core-Direct communication offload capability to optimize the MPI_Allgather for Core-Direct enabled systems. In particular, the RRK introduces a reindexing scheme which ensures contiguous data transfers while adding only a single additional send and receive operation for any radix, k, or communicator size, N. This allows us to improve algorithm scalability by avoiding the use of a scatter/gather elements (SGE) list on InfiniBand networks. The implementations of the RRK algorithm and its evaluation shows that it performs and scales well on Core-Direct systems for a wide range of message sizes and various communicator configurations.