Deciding which queue to join: Some counterexamples
Operations Research
An approximation to the response time for shortest queue routing
SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Wide area traffic: the failure of Poisson modeling
IEEE/ACM Transactions on Networking (TON)
Exploiting process lifetime distributions for dynamic load balancing
ACM Transactions on Computer Systems (TOCS)
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Heavy-tailed probability distributions in the World Wide Web
A practical guide to heavy tails
Load-sensitive routing of long-lived IP flows
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Load-balancing heuristics and process behavior
SIGMETRICS '86/PERFORMANCE '86 Proceedings of the 1986 ACM SIGMETRICS joint international conference on Computer performance modelling, measurement and evaluation
On choosing a task assignment policy for a distributed server system
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Implementing Multiprocessor Scheduling Disciplines
IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Theory and Practice in Parallel Job Scheduling
IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Improved Utilization and Responsiveness with Gang Scheduling
IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
A parallel workload model and its implications for processor allocation
HPDC '97 Proceedings of the 6th IEEE International Symposium on High Performance Distributed Computing
HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
Load profiling: a methodology for scheduling real-time tasks in a distributed system
ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Cycle stealing under immediate dispatch task assignment
Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
Analysis of Task Assignment with Cycle Stealing under Central Queue
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
Class-Dependent Assignment in cluster-based servers
Proceedings of the 2004 ACM symposium on Applied computing
Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Design and implementation of a feedback controller for slowdown differentiation on internet servers
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Robust Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers
IEEE Transactions on Computers
On the Distribution of Sequential Jobs in Random Brokering for Heterogeneous Computational Grids
IEEE Transactions on Parallel and Distributed Systems
Deferred Assignment Scheduling in Cluster-Based Servers
Cluster Computing
Systems with multiple servers under heavy-tailed workloads
Performance Evaluation - Performance 2005
A simulation-based approach for dynamic process management at web service platforms
Computers and Industrial Engineering
Resource Allocation for Session-Based Two-Dimensional Service Differentiation on e-Commerce Servers
IEEE Transactions on Parallel and Distributed Systems
Task assignment with work-conserving migration
Parallel Computing
Allocation of Service Time in a Multiserver System
Management Science
Analysis of join-the-shortest-queue routing for web server farms
Performance Evaluation
Insensitivity for PS server farms with JSQ routing
ACM SIGMETRICS Performance Evaluation Review
A content-based load balancing algorithm with admission control for cluster web servers
Future Generation Computer Systems
Comparing job allocation schemes where service demand is unknown
Journal of Computer and System Sciences
Resource allocation optimization for quantitative service differentiation on server clusters
Journal of Parallel and Distributed Computing
Controlling the Behaviour of Database Servers with 2PAC and DiffServ
DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
Analysis of size interval task assignment policies
ACM SIGMETRICS Performance Evaluation Review
Surprising results on task assignment in server farms with high-variability workloads
Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems
A simulation-based approach for dynamic process management at web service platforms
Computers and Industrial Engineering
Performance Evaluation
Task assignment on parallel QoS systems
WISE'07 Proceedings of the 8th international conference on Web information systems engineering
Why segregating short jobs from long jobs under high variability is not always a win
Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Optimal allocation of servers and processing time in a load balancing system
Computers and Operations Research
Load balancing via random local search in closed and open systems
Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems
To balance or unbalance load in size-interval task allocation
Probability in the Engineering and Informational Sciences
Improving MapReduce performance in heterogeneous environments
OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
A predictive and probabilistic load-balancing algorithm for cluster-based web servers
Applied Soft Computing
The GHS grid scheduling system: implementation and performance comparison
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Modelling job allocation where service duration is unknown
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Proposal of an analytical solution for the load imbalance problem in parallel systems
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
Queueing Systems: Theory and Applications
A case for on-machine load balancing
Journal of Parallel and Distributed Computing
Implementation of round robin policy in DNS for thresholding of distributed web server system
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Review: Task assignment policies in distributed server systems: A survey
Journal of Network and Computer Applications
The price of forgetting in parallel and non-observable queues
Performance Evaluation
Quantitative service differentiation: a square-root proportional model
EUC'06 Proceedings of the 2006 international conference on Embedded and Ubiquitous Computing
Regression-based resource provisioning for session slowdown guarantee in multi-tier Internet servers
Journal of Parallel and Distributed Computing
Multi-core code in a cluster – a meaningful option?
GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
Task assignment based on prioritising traffic flows
OPODIS'04 Proceedings of the 8th international conference on Principles of Distributed Systems
Mitigating the negative impact of preemption on heterogeneous MapReduce workloads
Proceedings of the 7th International Conference on Network and Services Management
On the nature and impact of self-similarity in real-time systems
Real-Time Systems
Minimizing slowdown in heterogeneous size-aware dispatching systems
Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Test-cost-sensitive attribute reduction
Information Sciences: an International Journal
Load balancing via random local search in closed and open systems
Queueing Systems: Theory and Applications
Analysis of SITA queues with many servers and spacetime geometry
ACM SIGMETRICS Performance Evaluation Review
Double auction-inspired meta-scheduling of parallel applications on global grids
Journal of Parallel and Distributed Computing
Distributed oblivious load balancing using prioritized job replication
Proceedings of the 8th International Conference on Network and Service Management
Hi-index | 0.01 |
We consider a distributed server system and ask which policy should be used for assigning jobs (tasks) to hosts. In our server, jobs are not preemptible. Also, the job's service demand is not known a priori. We are particularly concerned with the case where the workload is heavy-tailed, as is characteristic of many empirically measured computer workloads. We analyze several natural task assignment policies and propose a new one TAGS (Task Assignment based on Guessing Size). The TAGS algorithm is counterintuitive in many respects, including load unbalancing, non-work-conserving, and fairness. We find that under heavy-tailed workloads, TAGS can outperform all task assignment policies known to us by several orders of magnitude with respect to both mean response time and mean slowdown, provided the system load is not too high. We also introduce a new practical performance metric for distributed servers called server expansion. Under the server expansion metric, TAGS significantly outperforms all other task assignment policies, regardless of system load.