Iterative Methods for Sparse Linear Systems
Iterative Methods for Sparse Linear Systems
An Analysis of Performance Enhancement Techniques for Overset Grid Applications
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Task Assignment Heuristics for Distributed CFD Applications
ICPPW '01 Proceedings of the 2001 International Conference on Parallel Processing Workshops
Parallel solution of unsteady incompressible viscous flows using multiblock structured grids
Parallel solution of unsteady incompressible viscous flows using multiblock structured grids
Graph partitioning for high-performance scientific simulations
Sourcebook of parallel computing
Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Porting CFD codes towards grids: a case study
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
Hi-index | 0.00 |
The overset grid methodology has significantly reduced time-to-solution of high-fidelity computational fluid dynamics (CFD) simulations about complex aerospace configurations. The solution process resolves the geometrical complexity of the problem domain by using separately generated but overlapping structured discretization grids that periodically exchange information through interpolation. However, high performance computations of such large-scale realistic applications must be handled efficiently on state-of-the-art parallel supercomputers. This paper analyzes the effects of various performance enhancement strategies on the parallel efficiency of an overset grid Navier-Stokes CFD application running on an SGI Origin2000 machine. Specifically, the role of asynchronous communication, grid splitting, and grid grouping strategies are presented and discussed. Details of a sophisticated graph partitioning technique for grid grouping are also provided. Results indicate that performance depends critically on the level of latency hiding and the quality of load balancing across the processors.