The Gradient Model Load Balancing Method
IEEE Transactions on Software Engineering - Special issue on distributed systems
Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes
IEEE Transactions on Computers
A parallel graph partitioning algorithm for a message-passing multiprocessor
International Journal of Parallel Programming
Dynamic load balancing for distributed memory multiprocessors
Journal of Parallel and Distributed Computing
Task allocation onto a hypercube by recursive mincut bipartitioning
Journal of Parallel and Distributed Computing
Solving problems on concurrent processors: vol. 2
Solving problems on concurrent processors: vol. 2
Analysis of a graph coloring based distributed load balancing algorithm
Journal of Parallel and Distributed Computing
SIAM Journal on Scientific and Statistical Computing
Performance of dynamic load balancing algorithms for unstructured mesh calculations
Concurrency: Practice and Experience
Rectilinear partitioning of irregular data parallel computations
Journal of Parallel and Distributed Computing
An improved spectral graph partitioning algorithm for mapping parallel computations
SIAM Journal on Scientific Computing
The generalized dimension exchange method for load balancing in k-ary n-cubes and variants
Journal of Parallel and Distributed Computing
A multilevel algorithm for partitioning graphs
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Dynamic Partitioning of Non-Uniform Structured Workloads with Spacefilling Curves
IEEE Transactions on Parallel and Distributed Systems
Fast and parallel mapping algorithms for irregular problems
The Journal of Supercomputing
On Runtime Parallel Scheduling for Processor Load Balancing
IEEE Transactions on Parallel and Distributed Systems
Parallel Incremental Graph Partitioning
IEEE Transactions on Parallel and Distributed Systems
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
Programming a Hypercube Multicomputer
IEEE Software
Strategies for Dynamic Load Balancing on Highly Parallel Computers
IEEE Transactions on Parallel and Distributed Systems
Runtime Incremental Parallel Scheduling (RIPS) on Distributed Memory Computers
IEEE Transactions on Parallel and Distributed Systems
HPCN Europe 1996 Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking
Geometric mesh partitioning: implementation and experiments
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
A linear-time heuristic for improving network partitions
DAC '82 Proceedings of the 19th Design Automation Conference
The Direct Dimension Exchange Method for Load Balancing in k-ary n-cubes
SPDP '96 Proceedings of the 8th IEEE Symposium on Parallel and Distributed Processing (SPDP '96)
Graph Theory With Applications
Graph Theory With Applications
A Web-Based Finite Element Meshes Partitioner and Load Balancer
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Hi-index | 0.00 |
In this paper, we propose a prefix code matching parallel load-balancing method (PCMPLB) to efficiently deal with the load imbalance of solution-adaptive finite element application programs on distributed memory multicomputers. The main idea of the PCMPLB method is first to construct a prefix code tree for processors. Based on the prefix code tree, a schedule for performing load transfer among processors can be determined by concurrently and recursively dividing the tree into two subtrees and finding a maximum matching for processors in the two subtrees until the leaves of the prefix code tree are reached. We have implemented the PCMPLB method on an SP2 parallel machine and compared its performance with two load-balancing methods, the directed diffusion method and the multilevel diffusion method, and five mapping methods, the AE/ORB method, the AE/MC method, the MLkP method, the PARTY library method, and the JOSTLE-MS method. An unstructured finite element graph Truss was used as a test sample. During the execution, Truss was refined five times. Three criteria, the execution time of mapping/load-balancing methods, the execution time of an application program under different mapping/load-balancing methods, and the speedups achieved by mapping/load-balancing methods for an application program, are used for the performance evaluation. The experimental results show that (1) if a mapping method is used for the initial partitioning and this mapping method or a load-balancing method is used in each refinement, the execution time of an application program under a load-balancing method is less than that of the mapping method. (2) The execution time of an application program under the PCMPLB method is less than that of the directed diffusion method and the multilevel diffusion method.