Matrix computations (3rd ed.)
LAPACK Working Note 56: Reducing Communication Costs in the Conjugate Gradient Algorithm on Distributed Memory Multiprocessors
Message-Passing Performance of Various Computers
Message-Passing Performance of Various Computers
Parallel Performance in Multi-physics Simulation
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Multi-component free surface flows and rotating devices in the context of minerals processing
International Journal of Computational Fluid Dynamics - Mining and Mineral Processing
Multi-component free surface flows and rotating devices in the context of minerals processing
International Journal of Computational Fluid Dynamics - CFD Methods of Mining and Mineral Processing
Two-constraint domain decomposition with Space Filling Curves
Parallel Computing
Hi-index | 0.00 |
Realizing scalable performance on high performance computing systems is not straightforward for single-phenomenon codes (such as computational fluid dynamics [CFD]). This task is magnified considerably when the target software involves the interactions of a range of phenomena that have distinctive solution procedures involving different discretization methods. The problems of addressing the key issues of retaining data integrity and the ordering of the calculation procedures are significant. A strategy for parallelizing this multiphysics family of codes is described for software exploiting finite-volume discretization methods on unstructured meshes using iterative solution procedures. A mesh partitioning-based SPMD approach is used. However, since different variables use distinct discretization schemes, this means that distinct partitions are required; techniques for addressing this issue are described using the mesh-partitioning tool, JOSTLE. In this contribution, the strategy is tested for a variety of test cases under a wide range of conditions (e.g., problem size, number of processors, asynchronous/synchronous communications, etc.) using a variety of strategies for mapping the mesh partition onto the processor topology.