Optimal allocation of servers and processing time in a load balancing system

Authors:
Bacel Maddah;Muhammad El-Taha;Roy Abou Tayeh
Affiliations:
Engineering Management Program, American University of Beirut, Beirut 1107 2020, Lebanon;Department of Mathematics and Statistics, University of Southern Maine, 96 Falmouth Street, Portland, ME 04104-9300, USA;Engineering Management Program, American University of Beirut, Beirut 1107 2020, Lebanon
Venue:
Computers and Operations Research
Year:
2010

Citing 16
Cited 0

Commonalities in reengineered business processes: models and issues

Management Science
Exploiting process lifetime distributions for dynamic load balancing

ACM Transactions on Computer Systems (TOCS)
Self-similarity in World Wide Web traffic: evidence and possible causes

IEEE/ACM Transactions on Networking (TON)
Measurement-based modelling of Internet dial-up access connections

Computer Networks: The International Journal of Computer and Telecommunications Networking
Partitioning Customers Into Service Groups

Management Science
On choosing a task assignment policy for a distributed server system

Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Task assignment with unknown duration

Journal of the ACM (JACM)
EQUILOAD: a load balancing policy for clustered web servers

Performance Evaluation
Allocation of service time in a two-server system

Computers and Operations Research
Analysis of Task Assignment with Cycle Stealing under Central Queue

ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
New directions in machine scheduling

New directions in machine scheduling
Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems

Performance Evaluation - Performance 2005
Task assignment with work-conserving migration

Parallel Computing
Allocation of Service Time in a Multiserver System

Management Science
Analysis of size interval task assignment policies

ACM SIGMETRICS Performance Evaluation Review
Surprising results on task assignment in server farms with high-variability workloads

Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

We consider the problem of allocating processing time in a multi-channel load balancing system by focusing on systems where processing times have distributions characterized by high variability. Our objective is to reduce congestion by routing jobs to servers based on their workload. Specifically, we arrange servers in two stations in series, and require that the load be balanced between the two stations. All arrivals join the first service center where they receive a maximum of T units of service. Arrivals with service requirements that exceed the value T join the second station where they receive their remaining service. For a variety of heavy tail service time distributions, characterized by high variability, analytical and numerical comparisons show that our scheme provides better system performance than the standard parallel multi-server model in the sense of reducing the mean delay per customer when the traffic intensity is not too low. In particular, we develop lower bounds on the traffic intensity and the service time coefficient of variation beyond which the balanced series system outperforms the parallel system.