Co-array Fortran for parallel programming
ACM SIGPLAN Fortran Forum
Transparent adaptive parallelism on NOWs using OpenMP
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Terascale spectral element dynamical core for atmospheric general circulation models
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
High-Level Data Mapping for Clusters of SMPs
HIPS '01 Proceedings of the 6th International Workshop on High-Level Parallel Programming Models and Supportive Environments
ICPP '02 Proceedings of the 2001 International Conference on Parallel Processing
Benchmark Design for Characterization of Balanced High-Performance Architectures
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Group-Based Performance Analysis for Multithreaded SMP Cluster Applications
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
Effective Communication and File-I/O Bandwidth Benchmarks
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
EGPGV '02 Proceedings of the Fourth Eurographics Workshop on Parallel Graphics and Visualization
Optimizing parallel performance of unstructured volume rendering for the Earth Simulator
Parallel Computing - Parallel graphics and visualisation
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Scalability of hybrid programming for a CFD code on the earth simulator
Parallel Computing
Applied Numerical Mathematics - 6th IMACS International symposium on iterative methods in scientific computing
An extended OpenMP targeting on the hybrid architecture of SMP-cluster
ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
Execution model of three parallel languages: OpenMP, UPC and CAF
Scientific Programming - International Symposium of Parallel and Distributed Computing & International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogenous Networks
Applied Numerical Mathematics - 6th IMACS International symposium on iterative methods in scientific computing
MPI correctness checking for OpenMP/MPI applications
International Journal of Parallel Programming
Hi-index | 0.00 |
Most HPC systems are clusters of shared memory nodes. Parallel programming must combine the distributed memory parallelization on the node inter-connect with the shared memory parallelization inside of each node. This paper introduces several programming models for hybrid systems. It focuses on programming methods that can achieve optimal inter-node communication bandwidth and on the hybrid MPI+OpenMP approach and its programming rules. The communication behavior is compared with the pure MPI programming paradigm and with RDMA and NUMA based programming models.