Scalability of communicators and groups in MPI

Authors:
Humaira Kamal;Seyed M. Mirtaheri;Alan Wagner
Affiliations:
University of British Columbia, Vancouver, BC, Canada;University of British Columbia, Vancouver, BC, Canada;University of British Columbia, Vancouver, BC, Canada
Venue:
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Year:
2010

Citing 12
Cited 8

Graph-Based Algorithms for Boolean Function Manipulation

IEEE Transactions on Computers
Using MPI (2nd ed.): portable parallel programming with the message-passing interface

Using MPI (2nd ed.): portable parallel programming with the message-passing interface
Generalized Communicators in the Message Passing Interface

IEEE Transactions on Parallel and Distributed Systems
Capriccio: scalable threads for internet services

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Design and Evaluation of Nemesis, a Scalable, Low-Latency, Message-Passing Communication Subsystem

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Understanding the High-Performance-Computing Community: A Software Engineer's Perspective

IEEE Software
Evaluating Sparse Data Storage Techniques for MPI Groups and Communicators

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
RLH: Bitmap compression technique based on run-length and Huffman encoding

Information Systems
MPI on a Million Processors

Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A Linear-Time Burrows-Wheeler Transform Using Induced Sorting

SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Breaking a Time-and-Space Barrier in Constructing Full-Text Indices

SIAM Journal on Computing
Issues in developing a thread-safe MPI implementation

EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface

Compact and efficient implementation of the MPI group operations

EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Scalable memory use in MPI: a case study with MPICH2

EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Delegation-Based MPI communications for a hybrid parallel computer with many-core architecture

EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Efficient multithreaded context ID allocation in MPI

EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
An integrated runtime scheduler for MPI

EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Scalable algorithms for constructing balanced spanning trees on system-ranked process groups

EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
On compressing permutations and adaptive sorting

Theoretical Computer Science
An integrated fine-grain runtime system for MPI

Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

As the number of cores inside compute clusters continues to grow, the scalability of MPI (Message Passing Interface) is important to ensure that programs can continue to execute on an ever-increasing number of cores. One important scalability issue for MPI is the implementation of communicators and groups. Communicators and groups are an integral part of MPI and play an essential role in the design and use of libraries. It is challenging to create an MPI implementation to support communicators and groups to scale to the hundreds of thousands of processes that are possible in today's clusters. In this paper we present the design and evaluation of techniques to support the scalability of communicators and groups in MPI. We have designed and implemented a fine-grain version of MPI (FG-MPI) based on MPICH2, that allows thousands of full-fledged MPI processes inside an operating system process. Using FG-MPI we can create hundreds and thousands of MPI processes, which allowed us to implement and evaluate solutions to the scalability issues associated with communicators. We describe techniques to allow for sharing of group information inside processes, and the design of scalable operations to create the communicators. A set plus permutation framework is introduced for storing group information for communicators and a set, instead of map, representation is proposed for MPI group objects. Performance results are given for the execution of a MPI benchmark program with upwards of 100,000 processes with communicators created for various groups of different sizes and types.