A Microbenchmark Suite for Mixed-Mode OpenMP/MPI
IWOMP '09 Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Automatic Hybrid MPI+OpenMP Code Generation with llc
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Adaptive MPI multirail tuning for non-uniform input/output access
EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Locality and topology aware intra-node communication among multicore CPUs
EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Scientific Programming - Exploring Languages for Expressing Medium to Massive On-Chip Parallelism
MPI/OpenMP hybrid parallel inference for Latent Dirichlet Allocation
Proceedings of the Third Workshop on Large Scale Data Mining: Theory and Applications
Hybrid programming model for implicit PDE simulations on multicore architectures
IWOMP'11 Proceedings of the 7th international conference on OpenMP in the Petascale era
Cache size in a cost model for heterogeneous skeletons
Proceedings of the fifth international workshop on High-level parallel programming and applications
Impact of Kernel-assisted MPI communication over scientific applications: CPMD and FFTW
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Scientific application based performance on magic cube
HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Hybrid parallel programming on SMP clusters using XPFortran and OpenMP
IWOMP'10 Proceedings of the 6th international conference on Beyond Loop Level Parallelism in OpenMP: accelerators, Tasking and more
Impact of over-decomposition on coordinated checkpoint/rollback protocol
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing - Volume 2
Solar filaments detection using parallel programming in hybrid architectures
Proceedings of the 2012 workshop on High-Performance Computing for Astronomy Date
Nonuniform memory affinity strategy in multithreaded sparse matrix computations
Proceedings of the 2012 Symposium on High Performance Computing
Hybrid parallelization of a large-scale heart model
Facing the Multicore-Challenge II
Hierarchical parallel approach in vascular network modeling: hybrid MPI+OpenMP implementation
PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Usage of petri nets for high performance computing
Proceedings of the 1st ACM SIGPLAN workshop on Functional high-performance computing
Leveraging MPI's one-sided communication interface for shared-memory programming
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Ownership passing: efficient distributed memory programming on multi-core systems
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming
Energy estimation for MPI broadcasting algorithms in large scale HPC systems
Proceedings of the 20th European MPI Users' Group Meeting
Proceedings of the 20th European MPI Users' Group Meeting
Coordinator-master-worker model for efficient large scale network simulation
Proceedings of the 6th International ICST Conference on Simulation Tools and Techniques
INSPIRE: the insieme parallel intermediate representation
PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming
The Journal of Supercomputing
Hi-index | 0.00 |
Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: Shared memory nodes with several multi-core CPUs are connected via a network infrastructure. Parallel programming must combine distributed memory parallelization on the node interconnect with shared memory parallelization inside each node. We describe potentials and challenges of the dominant programming models on hierarchically structured hardware: Pure MPI (Message Passing Interface), pure OpenMP (with distributed shared memory extensions) and hybrid MPI+OpenMP in several flavors. We pinpoint cases where a hybrid programming model can indeed be the superior solution because of reduced communication needs and memory consumption, or improved load balance. Furthermore we show that machine topology has a significant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future. Finally we give an outlook on possible standardization goals and extensions that could make hybrid programming easier to do with performance in mind.