Dynamic Load Balancing on Web-Server Systems
IEEE Internet Computing
Queueing Network Models with Two Classes of Customers
MASCOTS '97 Proceedings of the 5th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems
Optimizing a 'Content-Aware' Load Balancing Strategy for Shared Web Hosting Service
MASCOTS '00 Proceedings of the 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
ADAPTLOAD: Effective Balancing in Custered Web Servers Under Transient Load Conditions
ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
Workload-Aware Load Balancing for Clustered Web Servers
IEEE Transactions on Parallel and Distributed Systems
Insensitivity for PS server farms with JSQ routing
ACM SIGMETRICS Performance Evaluation Review
Autonomic mix-aware provisioning for non-stationary data center workloads
Proceedings of the 7th international conference on Autonomic computing
Load-Balancing Dynamic Service Binding in Composition Execution Engines
APSCC '10 Proceedings of the 2010 IEEE Asia-Pacific Services Computing Conference
Resource provisioning of web applications in heterogeneous clouds
WebApps'11 Proceedings of the 2nd USENIX conference on Web application development
Achieving application-centric performance targets via consolidation on multicores: myth or reality?
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Dynamic Replication in Service-Oriented Systems
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Opportunistic Service Provisioning in the Cloud
CLOUD '12 Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing
Model-driven consolidation of Java workloads on multicores
DSN '12 Proceedings of the 2012 42nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)
Hi-index | 0.00 |
Today's web services are commonly hosted on clusters of servers that are often located within computing clouds, whose computational and storage resources can be highly heterogeneous. The workload served typically exhibits disparate computation patterns (e.g., CPU-intensive or IO-intensive), that fluctuate both in terms of volume and mix. The system heterogeneity together with workload diversity further exacerbates the challenge of effective distribution of load within a computing cloud. This paper presents a novel, mix-aware load-balancing algorithm, which aims to distribute requests sent by multiple applications in heterogeneous servers such that the application response times are minimized and system resources (e.g., CPU and IO) are equally utilized. To this end, the presented algorithm tries to not only balance the total number of requests seen by each server, but also to shape the requests received by each server into a certain "mix", that is analytically shown to be optimal for response time minimization. Our experimental results---based both on simulation and on a prototype implementation---show that the mix-aware algorithm achieves robust performance in most workload mixes as well as a consistent performance improvement in comparison with one of the most robust load-balancing schemes of the Apache server.