Internet Web servers: workload characterization and performance implications
IEEE/ACM Transactions on Networking (TON)
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
On choosing a task assignment policy for a distributed server system
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Analytic modeling of load balancing policies for tasks with heavy-tailed distributions
Proceedings of the 2nd international workshop on Software and performance
Task assignment with unknown duration
Journal of the ACM (JACM)
EQUILOAD: a load balancing policy for clustered web servers
Performance Evaluation
Delay moments for FIFO GI/GI/s queues
Queueing Systems: Theory and Applications
New bounds for expected delay in FIFO GI/GI/c queues
Queueing Systems: Theory and Applications
Further delay moment results for FIFO multiserver queues
Queueing Systems: Theory and Applications
The impact of a heavy-tailed service-time distribution upon the M/GI/s waiting-time distribution
Queueing Systems: Theory and Applications
Cycle stealing under immediate dispatch task assignment
Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
ADAPTLOAD: Effective Balancing in Custered Web Servers Under Transient Load Conditions
ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
A packet-size aware adaptive routing algorithm for parallel transmission server systems
Journal of Parallel and Distributed Computing
Workload-Aware Load Balancing for Clustered Web Servers
IEEE Transactions on Parallel and Distributed Systems
Theory, Volume 1, Queueing Systems
Theory, Volume 1, Queueing Systems
Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems
Performance Evaluation - Performance 2005
Load Balancing for Performance Differentiation in Dual-Priority Clustered Servers
QEST '06 Proceedings of the 3rd international conference on the Quantitative Evaluation of Systems
Queueing Systems: Theory and Applications
A comparative analysis of web and peer-to-peer traffic
Proceedings of the 17th international conference on World Wide Web
Performance-Guided Load (Un)balancing under Autocorrelated Flows
IEEE Transactions on Parallel and Distributed Systems
Surprising results on task assignment in server farms with high-variability workloads
Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems
To balance or unbalance load in size-interval task allocation
Probability in the Engineering and Informational Sciences
Minimizing slowdown in heterogeneous size-aware dispatching systems
Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Analysis of SITA queues with many servers and spacetime geometry
ACM SIGMETRICS Performance Evaluation Review
Mathematics of Operations Research
Hi-index | 0.00 |
We analyze the performance of Size Interval Task Assignment (SITA) policies, for multi-host assignment in a non-preemptive environment. Assuming Poisson arrivals, we provide general bounds on the average waiting time independent of the job size distribution. We establish a general duality theory for the performance of SITA policies. We provide a detailed analysis of the performance of SITA systems when the job size distribution is Bounded Pareto and the range of job sizes tends to infinity. In particular, we determine asymptotically optimal cutoff values and provide asymptotic formulas for average waiting time and slowdown. We compare the results with the Least Work Remaining policy and compute which policy is asymptotically better for any given set of parameters. In the case of inhomogeneous hosts, we determine their optimal ordering.