High-level management of communication schedules in HPF-like languages
ICS '98 Proceedings of the 12th international conference on Supercomputing
Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Extending OpenMP for NUMA machines
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Coastal ocean modeling of the U.S. west coast with multiblock grid and dual-level parallelism
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Terascale spectral element dynamical core for atmospheric general circulation models
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
High-Level Data Mapping for Clusters of SMPs
HIPS '01 Proceedings of the 6th International Workshop on High-Level Parallel Programming Models and Supportive Environments
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
Exploiting Data Locality on Scalable Shared Memory Machines with Data Parallel Programs
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
VFC: The Vienna Fortran Compiler
Scientific Programming
An extended OpenMP targeting on the hybrid architecture of SMP-cluster
ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
Hi-index | 0.00 |
In this paper we present HPF extensions for clusters of SMPs and their implementation within the VFC compiler. The main goal of these extensions is to optimize HPF for clusters of SMPs by enhancing the functionality of the mapping mechanisms and by providing the user with high-level means for controlling key aspects of distributed-memory and shared-memory parallelization. Based on the proposed language extensions, the VFC compiler adopts a hybrid parallelization strategy which closely reflects the hierarchical structure of SMP clusters by exploiting shared-memory parallelism based on OpenMP within nodes and distributed-memory parallelism utilizing MPI across nodes. We describe the language extensions, outline the hybrid parallelization strategy of VFC and present experimental results which show the effectiveness of these techniques.