Enabling Flexible Collective Communication Offload with Triggered Operations

  • Authors:
  • Keith D. Underwood;Jerrie Coffman;Roy Larsen;K. Scott Hemmert;Brian W. Barrett;Ron Brightwell;Michael Levenhagen

  • Affiliations:
  • -;-;-;-;-;-;-

  • Venue:
  • HOTI '11 Proceedings of the 2011 IEEE 19th Annual Symposium on High Performance Interconnects
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Low latency collective communications are key to application scalability. As systems grow larger, minimizing collective communication time becomes increasingly challenging. Offload is an effective technique for accelerating collective operations, however, algorithms for collective communication constantly evolve such that flexible implementations are critical. This paper presents triggered operations--a semantic building block that allows the key components of collective communications to be offloaded while allowing the host side software to define the algorithm. Simulations are used to demonstrate the performance improvements achievable through the offload of MPI_Allreduce using these building blocks.