Improvements to the structural simulation toolkit
Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
A low impact flow control implementation for offload communication interfaces
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Design, implementation, and performance evaluation of MPI 3.0 on portals 4.0
Proceedings of the 20th European MPI Users' Group Meeting
Hi-index | 0.00 |
Low latency collective communications are key to application scalability. As systems grow larger, minimizing collective communication time becomes increasingly challenging. Offload is an effective technique for accelerating collective operations, however, algorithms for collective communication constantly evolve such that flexible implementations are critical. This paper presents triggered operations--a semantic building block that allows the key components of collective communications to be offloaded while allowing the host side software to define the algorithm. Simulations are used to demonstrate the performance improvements achievable through the offload of MPI_Allreduce using these building blocks.