A least flow-time first load sharing approach for distributed server farm

Authors:
Zahir Tari;James Broberg;Albert Y. Zomaya;Roberto Baldoni
Affiliations:
RMIT University, Australia;RMIT University, Australia;University of Sydney, Australia;University of Rome, Italy
Venue:
Journal of Parallel and Distributed Computing
Year:
2005

Citing 12
Cited 6

Utopia: a load sharing facility for large, heterogeneous distributed computer systems

Software—Practice & Experience
Self-similarity in World Wide Web traffic: evidence and possible causes

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Task assignment in a distributed system (extended abstract): improving performance by unbalancing load

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Heavy-tailed probability distributions in the World Wide Web

A practical guide to heavy tails
Minimizing the flow time without migration

STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Hydrodynamic Load Balancing

IEEE Transactions on Parallel and Distributed Systems
On choosing a task assignment policy for a distributed server system

Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Methodical Analysis of Adaptive Load Sharing Algorithms

IEEE Transactions on Parallel and Distributed Systems
Design and Evaluation of Effective Load Sharing in Distributed Real-Time Systems

IEEE Transactions on Parallel and Distributed Systems
Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness

HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
Task Assignment with Unknown Duration

ICDCS '00 Proceedings of the The 20th International Conference on Distributed Computing Systems ( ICDCS 2000)
On the relationship between file sizes, transport protocols, and self-similar network traffic

ICNP '96 Proceedings of the 1996 International Conference on Network Protocols (ICNP '96)

Surprising results on task assignment in server farms with high-variability workloads

Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems
Why segregating short jobs from long jobs under high variability is not always a win

Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
To balance or unbalance load in size-interval task allocation

Probability in the Engineering and Informational Sciences
The impact of quanta on the performance of multi-level time sharing policy under heavy-tailed workloads

ACSC '09 Proceedings of the Thirty-Second Australasian Conference on Computer Science - Volume 91
Review: Task assignment policies in distributed server systems: A survey

Journal of Network and Computer Applications
An enhanced grid scheduling with job priority and equitable interval job distribution

GPC'06 Proceedings of the First international conference on Advances in Grid and Pervasive Computing

Quantified Score

Hi-index	0.02

Visualization

Abstract

The most critical property exhibited by a heavy-tailed workload distribution (found in many WWW workloads) is that a very small fraction of tasks make up a large fraction of the workload, making the load very difficult to distribute in a distributed system. Load balancing and load sharing are the two predominant load distribution strategies used in such systems. Load sharing generally has better response time than load balancing because the latter can exhibit excessive overheads in selecting servers and partitioning tasks. We therefore further explored the least-loaded-first (LLF) load sharing approach and found two important limitations: (a) LLF does not consider the order of processing, and (b) when it assigns a task, LLF does not consider the processing capacity of servers. The high task size variation that exists in heavy-tailed workloads often causes smaller tasks to be severely delayed by large tasks. This paper proposes a size-based approach, called the least flow-time first (LFF-SIZE), which reduces the delay caused by size variation while maintaining a balanced load in the system. LFF-SIZE takes the relative processing time of a task into account and dynamically assigns a task to the fittest server with a lighter load and higher processing capacity. LFF-SIZE also uses a multi-section queue to separate larger tasks from smaller ones. This arrangement effectively reduces the delay of smaller tasks by larger ones as small tasks are given a higher priority to be processed. The performance results performed on the LFF-SIZE implementation shows a substantial improvement over existing load sharing and static size-based approaches under realistic heavy-tailed workloads.