Neighborhood communication paradigm to increase scalability in large-scale dynamic scientific applications

Authors:
Aleksandr Ovcharenko;Daniel Ibanez;Fabien Delalondre;Onkar Sahni;Kenneth E. Jansen;Christopher D. Carothers;Mark S. Shephard
Affiliations:
Scientific Computation Research Center (SCOREC), Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY 12180, USA;Scientific Computation Research Center (SCOREC), Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY 12180, USA;Scientific Computation Research Center (SCOREC), Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY 12180, USA;Scientific Computation Research Center (SCOREC), Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY 12180, USA;Department of Aerospace Engineering Sciences, University of Colorado at Boulder, Boulder, CO 80309, USA;Department of Computer Science, Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY 12180, USA;Scientific Computation Research Center (SCOREC), Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY 12180, USA
Venue:
Parallel Computing
Year:
2012

Citing 19
Cited 1

Runtime compilation techniques for data partitioning and communication schedule reuse

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Mesh adaptation strategies for problems in fluid dynamics

Finite Elements in Analysis and Design - Special issue: adaptive meshing part 2
Parallel automatic adaptive analysis

Parallel Computing - Special issue on applications: parallel computing methods in applied fluid mechanics
Load Balancing in CORBA: A Survey of Concepts, Patterns, and Techniques

The Journal of Supercomputing
Parallel and Distribution Simulation Systems

Parallel and Distribution Simulation Systems
Using MPI-2: Advanced Features of the Message Passing Interface

Using MPI-2: Advanced Features of the Message Passing Interface
A Load Balancing Framework for Adaptive and Asynchronous Applications

IEEE Transactions on Parallel and Distributed Systems
Toward simulation-based design

Finite Elements in Analysis and Design - Special issue: The fifteenth annual Robert J. Melosh competition
Parallel anisotropic 3D mesh adaptation by mesh modification

Engineering with Computers
Efficient distributed mesh data structure for parallel automated adaptive analysis

Engineering with Computers
ATLAS: A Scalable Network Framework for Distributed Virtual Environments

Presence: Teleoperators and Virtual Environments
Overlapping Communication and Computation with High Level Communication Routines

CCGRID '08 Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
The deep computing messaging framework: generalized scalable message passing on the blue gene/P supercomputer

Proceedings of the 22nd annual international conference on Supercomputing
A study of process arrival patterns for MPI collective operations

International Journal of Parallel Programming
Sparse collective operations for MPI

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Scalable communication protocols for dynamic sparse data exchange

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
A Simulation Framework to Automatically Analyze the Communication-Computation Overlap in Scientific Applications

CLUSTER '10 Proceedings of the 2010 IEEE International Conference on Cluster Computing
Unstructured mesh partition improvement for implicit finite element at extreme scale

The Journal of Supercomputing
Tools to support mesh adaptation on massively parallel computers

Engineering with Computers - Special Issue: The Mesh Trends Symposium

A parallel ghosting algorithm for the flexible distributed mesh database

Scientific Programming

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces a general-purpose communication package built on top of MPI which is aimed at improving inter-processor communications independently of the supercomputer architecture being considered. The package is developed to support parallel applications that rely on computation characterized by large number of messages of various sizes, often small, that are focused within processor neighborhoods. In some cases, such as solvers having static mesh partitions, the number and size of messages are known a priori. However, in other cases such as mesh adaptation, the messages evolve and vary in number and size and include the dynamic movement of partition objects. The current package provides a utility for dynamic applications based on two key attributes that are: (i) explicit consideration of the neighborhood communication pattern to avoid many-to-many calls and also to reduce the number of collective calls to a minimum, and (ii) use of non-blocking MPI functions along with message packing to manage message flow control and reduce the number and time of communication calls. The test application demonstrated is parallel unstructured mesh adaptation. Results on IBM Blue Gene/P and Cray XE6 computers show that the use of neighborhood-based communication control leads to scalable results when executing generally imbalanced mesh adaptation runs.