Commonalities in reengineered business processes: models and issues
Management Science
Exploiting process lifetime distributions for dynamic load balancing
ACM Transactions on Computer Systems (TOCS)
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
Measurement-based modelling of Internet dial-up access connections
Computer Networks: The International Journal of Computer and Telecommunications Networking
Partitioning Customers Into Service Groups
Management Science
On choosing a task assignment policy for a distributed server system
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Task assignment with unknown duration
Journal of the ACM (JACM)
EQUILOAD: a load balancing policy for clustered web servers
Performance Evaluation
Allocation of service time in a two-server system
Computers and Operations Research
Analysis of Task Assignment with Cycle Stealing under Central Queue
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
New directions in machine scheduling
New directions in machine scheduling
Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems
Performance Evaluation - Performance 2005
Task assignment with work-conserving migration
Parallel Computing
Allocation of Service Time in a Multiserver System
Management Science
Analysis of size interval task assignment policies
ACM SIGMETRICS Performance Evaluation Review
Surprising results on task assignment in server farms with high-variability workloads
Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems
Hi-index | 0.01 |
We consider the problem of allocating processing time in a multi-channel load balancing system by focusing on systems where processing times have distributions characterized by high variability. Our objective is to reduce congestion by routing jobs to servers based on their workload. Specifically, we arrange servers in two stations in series, and require that the load be balanced between the two stations. All arrivals join the first service center where they receive a maximum of T units of service. Arrivals with service requirements that exceed the value T join the second station where they receive their remaining service. For a variety of heavy tail service time distributions, characterized by high variability, analytical and numerical comparisons show that our scheme provides better system performance than the standard parallel multi-server model in the sense of reducing the mean delay per customer when the traffic intensity is not too low. In particular, we develop lower bounds on the traffic intensity and the service time coefficient of variation beyond which the balanced series system outperforms the parallel system.