On load balancing: a mix-aware algorithm for heterogeneous systems

Authors:
Sebastiano Spicuglia;Mathias Bjöerkqvist;Lydia Y. Chen;Giuseppe Serazzi;Walter Binder;Evgenia Smirni
Affiliations:
University of Lugano, Lugano, Switzerland;IBM Research Zurich Lab, Zurich, Switzerland;IBM Research Zurich Lab, Zurich, Switzerland;Politecnico Milano, Milano, Italy;University of Lugano, Lugano, Switzerland;College of William and Mary, Williamsburg, Virginia, USA
Venue:
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Year:
2013

Citing 13
Cited 0

Dynamic Load Balancing on Web-Server Systems

IEEE Internet Computing
Queueing Network Models with Two Classes of Customers

MASCOTS '97 Proceedings of the 5th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems
Optimizing a 'Content-Aware' Load Balancing Strategy for Shared Web Hosting Service

MASCOTS '00 Proceedings of the 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
ADAPTLOAD: Effective Balancing in Custered Web Servers Under Transient Load Conditions

ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
Workload-Aware Load Balancing for Clustered Web Servers

IEEE Transactions on Parallel and Distributed Systems
Insensitivity for PS server farms with JSQ routing

ACM SIGMETRICS Performance Evaluation Review
Autonomic mix-aware provisioning for non-stationary data center workloads

Proceedings of the 7th international conference on Autonomic computing
Load-Balancing Dynamic Service Binding in Composition Execution Engines

APSCC '10 Proceedings of the 2010 IEEE Asia-Pacific Services Computing Conference
Resource provisioning of web applications in heterogeneous clouds

WebApps'11 Proceedings of the 2nd USENIX conference on Web application development
Achieving application-centric performance targets via consolidation on multicores: myth or reality?

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Dynamic Replication in Service-Oriented Systems

CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Opportunistic Service Provisioning in the Cloud

CLOUD '12 Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing
Model-driven consolidation of Java workloads on multicores

DSN '12 Proceedings of the 2012 42nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Today's web services are commonly hosted on clusters of servers that are often located within computing clouds, whose computational and storage resources can be highly heterogeneous. The workload served typically exhibits disparate computation patterns (e.g., CPU-intensive or IO-intensive), that fluctuate both in terms of volume and mix. The system heterogeneity together with workload diversity further exacerbates the challenge of effective distribution of load within a computing cloud. This paper presents a novel, mix-aware load-balancing algorithm, which aims to distribute requests sent by multiple applications in heterogeneous servers such that the application response times are minimized and system resources (e.g., CPU and IO) are equally utilized. To this end, the presented algorithm tries to not only balance the total number of requests seen by each server, but also to shape the requests received by each server into a certain "mix", that is analytically shown to be optimal for response time minimization. Our experimental results---based both on simulation and on a prototype implementation---show that the mix-aware algorithm achieves robust performance in most workload mixes as well as a consistent performance improvement in comparison with one of the most robust load-balancing schemes of the Apache server.