Efficient and truly passive MPI-3 RMA using InfiniBand atomics

Authors:
M. Li;S. Potluri;K. Hamidouche;J. Jose;D. K. Panda
Affiliations:
-;-;-;-;-
Venue:
Proceedings of the 20th European MPI Users' Group Meeting
Year:
2013

Citing 3
Cited 0

Algorithms for scalable synchronization on shared-memory multiprocessors

ACM Transactions on Computer Systems (TOCS)
High Performance Distributed Lock Management Services using Network-based Remote Atomic Operations

CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Natively Supporting True One-Sided Communication in MPI on Multi-core Systems with InfiniBand

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid

Quantified Score

Hi-index	0.00

Visualization

Abstract

Multi/many-core architectures offer high compute density on modern supercomputing clusters. It is critical for applications to minimize communication and synchronization overheads to achieve peak performance. MPI offers one-sided communication semantics that are aimed at enabling this. In this paper, we propose a novel design for implementing truly passive shared and exclusive MPI_Win_lock/unlock using InfiniBand atomics. We address limitations in designs published earlier. We also present the impact of our design on MPI_Win_lock all introduced in MPI-3. We demonstrate superior overlap compared to existing two-sided implementations. Using Splash LU kernel, proposed design delivers upto 49% performance improvement compared to existing designs.