Dynamic Partitioning of Non-Uniform Structured Workloads with Spacefilling Curves
IEEE Transactions on Parallel and Distributed Systems
Encoding and decoding the Hilbert order
Software—Practice & Experience
Parallelization of a Dynamic Unstructured Algorithm Using Three Leading Programming Paradigms
IEEE Transactions on Parallel and Distributed Systems
Design and implementation of a parallel I/O runtime system for irregular applications
Journal of Parallel and Distributed Computing
A comparison of three programming models for adaptive applications on the origin2000
Journal of Parallel and Distributed Computing
Three-Dimensional High-Lift Analysis Using a Parallel Unstructured Multigrid Solver
Three-Dimensional High-Lift Analysis Using a Parallel Unstructured Multigrid Solver
High Resolution Aerospace Applications using the NASA Columbia Supercomputer
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
High Resolution Aerospace Applications Using the NASA Columbia Supercomputer
International Journal of High Performance Computing Applications
Parallel computing as a vehicle for engineering design of complex functional surfaces
Advances in Engineering Software
Impact of the columbia supercomputer on NASA science and engineering applications
IWDC'05 Proceedings of the 7th international conference on Distributed Computing
Multi-level parallelism for incompressible flow computations on GPU clusters
Parallel Computing
Hi-index | 0.00 |
This paper presents several algorithmic innovations and a hybrid programming style that lead to highly scalable performance using shared memory for a new computational fluid dynamics flow solver. This hybrid model is then converted to a strict message-passing implementation, and performance results for the two are compared. Results show that using this hybrid approach our OpenMP implementation is actually marginally faster than the MPI version, with parallel speedups of up to 599 out of 640 using OpenMP and 486 with MPI.