Optimal static load balancing in distributed computer systems
Journal of the ACM (JACM)
Adaptive Optimal Load Balancing in a Nonhomogeneous Multiserver System with a Central Job Scheduler
IEEE Transactions on Computers
Optimal load balancing and scheduling in a distributed computer system
Journal of the ACM (JACM)
The Probability of Load Balancing Success in a Homogeneous Network
IEEE Transactions on Software Engineering
Optimal load balancing in distributed computer systems
Optimal load balancing in distributed computer systems
Scheduling and Load Balancing in Parallel and Distributed Systems
Scheduling and Load Balancing in Parallel and Distributed Systems
Optimizing Static Job Scheduling in a Network of Heterogeneous Computers
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Theory, Volume 1, Queueing Systems
Theory, Volume 1, Queueing Systems
Allocating Non-Real-Time and Soft Real-Time Jobs in Multiclusters
IEEE Transactions on Parallel and Distributed Systems
Optimal load distribution in nondedicated heterogeneous cluster and grid computing environments
Journal of Systems Architecture: the EUROMICRO Journal
Minimizing the probability of load imbalance in heterogeneous distributed computer systems
Mathematical and Computer Modelling: An International Journal
Energy-Aware Scheduling on Multicore Heterogeneous Grid Computing Systems
Journal of Grid Computing
Hi-index | 0.00 |
Given a group of heterogeneous blade servers in a cloud computing environment or a data center of a cloud computing provider, each having its own size and speed and its own amount of preloaded special tasks, we are facing the problem of optimal distribution of generic tasks over these blade servers, such that the average response time of generic tasks is minimized. Such performance optimization is important for a cloud computing provider to efficiently utilize all the available resources and to deliver the highest quality of service. We develop a queueing model for a group of heterogeneous blade servers, and formulate and solve the optimal load distribution problem of generic tasks for multiple heterogeneous blade servers in a cloud computing environment in two different situations, namely, special tasks with and without higher priority. Extensive numerical examples and data are demonstrated and some important observations are made. It is found that server sizes, server speeds, task execution requirement, and the arrival rates of special tasks all have significant impact on the average response time of generic tasks, especially when the total arrival rate of generic tasks is large. It is also found that the server size heterogeneity and the server speed heterogeneity do not have much impact on the average response time of generic tasks. Furthermore, larger (smaller, respectively) heterogeneity results in shorter (longer, respectively) average response time of generic tasks.