A unified geometric approach to graph separators
SFCS '91 Proceedings of the 32nd annual symposium on Foundations of computer science
Towards a fast implementation of spectral nested dissection
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Rectilinear partitioning of irregular data parallel computations
Journal of Parallel and Distributed Computing
A parallel algorithm for multilevel graph partitioning and sparse matrix ordering
Journal of Parallel and Distributed Computing
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs
SIAM Journal on Scientific Computing
Numerical Linear Algebra for High Performance Computers
Numerical Linear Algebra for High Performance Computers
Non-uniform 2-D grid partitioning for heterogeneous parallel architectures
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
Graph Partitioning for Dynamic, Adaptive and Multi-phase Scientific Simulations
CLUSTER '01 Proceedings of the 3rd IEEE International Conference on Cluster Computing
Graph partitioning for high-performance scientific simulations
Sourcebook of parallel computing
Hi-index | 0.00 |
The cost/performance ratio of networks of workstations has been constantly improving. This trend is expected to continue in the near future. The aggregate peak rate of such systems often matches or exceeds the peak rate offered by the fastest parallel computers. This has motivated research toward using a network of computers, interconnected via a fast network (cluster system) or a simple Local Area Network (LAN) (distributed system), for high performance concurrent computations. Some of the important research issues arise such as (i) Problem partitioning and virtual interconnection topology mapping; (ii) Execution scheduling and load balancing.Past results exist for grid partitioning (into subdomains) and mapping to parallel and distributed systems. In our work we consider the problem of grid partitioning of a 3D domain arising in aircraft CFD simulations in order to schedule tasks for load balanced execution on a heterogeneous distributed system. This problem has additional restrictions on how to partition the grid. Past work for this problem were on parallel systems with only few processor configurations. We derive heuristic algorithms for: (1) homogeneous systems with any number of processors; (2) heterogeneous systems taking into account the processor speed and memory capacity. We implement our algorithms on a dedicated network of workstations (using MPI) and test them with a CFD simulation code (TURNS--Transonic Unsteady Rotor Navier Stokes).