Infiniband scalability in open MPI

Authors:
Galen M. Shipman;Tim S. Woodall;Rich L. Graham;Arthur B. Maccabe;Patrick G. Bridges
Affiliations:
Los Alamos National Laboratory, Advanced Computing Laboratory, Los Alamos, NM and University of New Mexico, Dept. of Computer Science, Albuquerque, NM;Los Alamos National Laboratory, Advanced Computing Laboratory, Los Alamos, NM;Los Alamos National Laboratory, Advanced Computing Laboratory, Los Alamos, NM;University of New Mexico, Dept. of Computer Science, Albuquerque, NM;University of New Mexico, Dept. of Computer Science, Albuquerque, NM
Venue:
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Year:
2006

Citing 6
Cited 11

MPI: a message passing interface

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
A high-performance, portable implementation of the MPI message passing interface standard

Parallel Computing
HARNESS and fault tolerant MPI

Parallel Computing - Clusters and computational grids for scientific computing
High performance RDMA-based MPI implementation over InfiniBand

ICS '03 Proceedings of the 17th annual international conference on Supercomputing
A network-failure-tolerant message-passing system for terascale clusters

International Journal of Parallel Programming
A comparison of 4X InfiniBand and Quadrics Elan-4 technologies

CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing

High-performance and scalable MPI over InfiniBand with reduced memory usage: an in-depth performance analysis

Proceedings of the 2006 ACM/IEEE conference on Supercomputing
High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters

Proceedings of the 21st annual international conference on Supercomputing
X-SRQ - Improving Scalability and Performance of Multi-core InfiniBand Clusters

Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Impact of Node Level Caching in MPI Job Launch Mechanisms

Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Scalable memory registration for high performance networks using helper threads

Proceedings of the 8th ACM International Conference on Computing Frontiers
Network fault tolerance in open MPI

Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Investigations on InfiniBand: efficient network buffer utilization at scale

PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Analysis of implementation options for MPI-2 one-sided

PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Using one-sided RDMA reads to build a fast, CPU-efficient key-value store

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
The Experience in Designing and Evaluating the High Performance Cluster Netuno

International Journal of Parallel Programming
FaRM: fast remote memory

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Infiniband is becoming an important interconnect technology in high performance computing. Recent efforts in large scale Infiniband deployments are raising scalability questions in the HPC community. Open MPI, a new open source implementation of the MPI standard targeted for production computing, provides several mechanisms to enhance Infiniband scalability. Initial comparisons with MVAPICH, the most widely used Infiniband MPI implementation, show similar performance but with much better scalability characteristics. Specifically, small message latency is improved by up to 10% in medium/large jobs and memory usage per host is reduced by as much as 300%. In addition, Open MPI provides predictable latency that is close to optimal without sacrificing bandwidth performance