The Gradient Model Load Balancing Method
IEEE Transactions on Software Engineering - Special issue on distributed systems
Epidemic algorithms for replicated database maintenance
PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
The token distribution problem
SIAM Journal on Computing
Dynamic load balancing for distributed memory multiprocessors
Journal of Parallel and Distributed Computing
Load balancing and Poisson equation in a graph
Concurrency: Practice and Experience
A semi distributed task allocation strategy for large hypercube supercomputers
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
And-parallel execution of logic programs on a shared-Memory multiprocessor
Journal of Logic Programming
Cilk: an efficient multithreaded runtime system
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Multilevel k-way partitioning scheme for irregular graphs
Journal of Parallel and Distributed Computing
ACM Transactions on Computer Systems (TOCS)
Run-time and compile-time support for adaptive irregular problems
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Diffusive Load-Balancing Policies for Dynamic Applications
IEEE Concurrency
Strategies for Dynamic Load Balancing on Highly Parallel Computers
IEEE Transactions on Parallel and Distributed Systems
Optimal Load Balancing in a Multiple Processor System with Many Job Classes
IEEE Transactions on Software Engineering
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
An Adaptive Framework for Large-Scale State Space Search
IPDPSW '11 Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Periodic hierarchical load balancing for large supercomputers
International Journal of High Performance Computing Applications
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Work stealing and persistence-based load balancers for iterative overdecomposed applications
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Automated Load Balancing Invocation Based on Application Characteristics
CLUSTER '12 Proceedings of the 2012 IEEE International Conference on Cluster Computing
Scalable Algorithms for Distributed-Memory Adaptive Mesh Refinement
SBAC-PAD '12 Proceedings of the 2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing
Hi-index | 0.00 |
For many applications, computation load varies over time. Such applications require dynamic load balancing to improve performance. Centralized load balancing schemes, which perform the load balancing decisions at a central location, are not scalable. In contrast, fully distributed strategies are scalable but typically do not produce a balanced work distribution as they tend to consider only local information. This paper describes a fully distributed algorithm for load balancing that uses partial information about the global state of the system to perform load balancing. This algorithm, referred to as GrapevineLB, consists of two stages: global information propagation using a lightweight algorithm inspired by epidemic [21] algorithms, and work unit transfer using a randomized algorithm. We provide analysis of the algorithm along with detailed simulation and performance comparison with other load balancing strategies. We demonstrate the effectiveness of GrapevineLB for adaptive mesh refinement and molecular dynamics on up to 131,072 cores of BlueGene/Q.