PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
Pthreads programming
Distributed and parallel computing
Distributed and parallel computing
Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering
Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering
High Performance Computing
MPI-The Complete Reference, Volume 1: The MPI Core
MPI-The Complete Reference, Volume 1: The MPI Core
PVMPI: An Integration of the PVM and MPI Systems
PVMPI: An Integration of the PVM and MPI Systems
Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Performance modeling and tuning of an unstructured mesh CFD application
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Parallel programming with message passing and directives
Computing in Science and Engineering
Analyzing the Parallel Scalability of an Implicit Unstructured Mesh CFD Code
HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Dual-level parallelism for deterministic and stochastic CFD problems
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Fast sparse matrix-vector multiplication for TeraFlop/s computers
VECPAR'02 Proceedings of the 5th international conference on High performance computing for computational science
An implementation of parallel eigenvalue computation using dual-level hybrid parallelism
ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
Solving the symmetric tridiagonal eigenproblem using MPI/OpenMP hybrid parallelization
APPT'05 Proceedings of the 6th international conference on Advanced Parallel Processing Technologies
Multi-level parallelism for incompressible flow computations on GPU clusters
Parallel Computing
Hi-index | 0.00 |
The authors describe their experiences converting an existing serial production code to a parallel code combining both MPI and OpenMP. Such dual-level parallel codes will be able to take full advantage of the emerging class of high performance computer architectures using small clusters of shared-memory processors connected via a message-passing network. While the focus is restricted to a harbor response simulation code, the techniques presented herein are appropriate for a broad class of applications that explore a parameter space. The code modifications reduced the execution time of one test case from 3100 minutes on a single CPU to just over 12 minutes on 256 CPUs. Results demonstrate that dual-level parallelism allows substantial increases in model resolution combined with improvements in simulation turnaround time but, contrary to conventional wisdom, requires very little source code alteration.