IEEE/ACM Transactions on Networking (TON)
On the self-similar nature of Ethernet traffic (extended version)
IEEE/ACM Transactions on Networking (TON)
Wide area traffic: the failure of Poisson modeling
IEEE/ACM Transactions on Networking (TON)
Internet Web servers: workload characterization and performance implications
IEEE/ACM Transactions on Networking (TON)
Generating representative Web workloads for network and server performance evaluation
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Flow and stretch metrics for scheduling continuous job streams
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Adaptive proportional delay differentiated services: characterization and performance evaluation
IEEE/ACM Transactions on Networking (TON)
Task assignment with unknown duration
Journal of the ACM (JACM)
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
Proportional differentiated services: delay differentiation and packet scheduling
IEEE/ACM Transactions on Networking (TON)
Feedback Control of Dynamic Systems
Feedback Control of Dynamic Systems
Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites
IEEE Transactions on Computers
Traffic model and performance evaluation of Web servers
Performance Evaluation
Application-level differentiated services for Web servers
World Wide Web
Performance Evaluation of Service Differentiating Internet Servers
IEEE Transactions on Computers
Metrics and Benchmarking for Parallel Job Scheduling
IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Size-based scheduling to improve web performance
ACM Transactions on Computer Systems (TOCS)
Feedback Control with Queueing-Theoretic Prediction for Relative Delay Guarantees in Web Servers
RTAS '03 Proceedings of the The 9th IEEE Real-Time and Embedded Technology and Applications Symposium
Queueing Model Based Network Server Performance Control
RTSS '02 Proceedings of the 23rd IEEE Real-Time Systems Symposium
A Feedback Control Approach for Guaranteeing Relative Delays in Web Servers
RTAS '01 Proceedings of the Seventh Real-Time Technology and Applications Symposium (RTAS '01)
A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation
IEEE Transactions on Parallel and Distributed Systems
Multi-processor scheduling to minimize flow time with ε resource augmentation
STOC '04 Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
IEEE Transactions on Parallel and Distributed Systems
ksniffer: determining the remote client perceived response time from live packet streams
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Connection scheduling in web servers
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
A self-tuning fuzzy control approach for end-to-end QoS guarantees in web servers
IWQoS'05 Proceedings of the 13th international conference on Quality of Service
A workload characterization study of the 1998 World Cup Web site
IEEE Network: The Magazine of Global Internetworking
Design and implementation of a feedback controller for slowdown differentiation on internet servers
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
eQoS: Provisioning of Client-Perceived End-to-End QoS Guarantees in Web Servers
IEEE Transactions on Computers
Consistent proportional delay differentiation: A fuzzy control approach
Computer Networks: The International Journal of Computer and Telecommunications Networking
Resource allocation optimization for quantitative service differentiation on server clusters
Journal of Parallel and Distributed Computing
Comparative evaluation of contiguous allocation strategies on 3D mesh multicomputers
Journal of Systems and Software
Review: Task assignment policies in distributed server systems: A survey
Journal of Network and Computer Applications
A self-tuning fuzzy control approach for end-to-end QoS guarantees in web servers
IWQoS'05 Proceedings of the 13th international conference on Quality of Service
URL: A unified reinforcement learning approach for autonomic cloud management
Journal of Parallel and Distributed Computing
A performance comparison of the contiguous allocation strategies in 3D mesh connected multicomputers
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Hi-index | 14.98 |
A desirable behavior of an Internet server is that a request's queuing delay depends on its service time in a linear fashion. Measuring the quality of service in terms of slowdown, the ratio of a request's queuing delay to its service time, provides a simple way to attain the objective. Moreover, it treats client requests equally regardless of their service time, whereas response time favors requests that need more processing resources. In this paper, we propose a proportional slowdown differentiation (PSD) service model on Internet servers. It aims to maintain prespecified slowdown ratios between different classes of client requests. To provide PSD services, we first derive a closed-form expression of the expected slowdown in an M/G/1 FCFS queuing system with a typical heavy-tailed service time distribution, the bounded Pareto distribution. Based on the closed-form expression, we design a queuing-theoretic strategy of processing-rate allocation. The rate allocation is realized by deploying a virtual server for each class. Simulation results show that the strategy can provide controllable PSD services on Internet servers. It, however, comes along with large variance and weak predictability due to the dynamics of Internet traffic. To address these issues, we design an integral feedback controller and integrate it into the queuing-theoretic strategy. Simulation results demonstrate that the integrated strategy is robust and can deliver predictable PSD services at a superior fine-grained level. We modified the Apache Web server with an implementation of the integrated processing-rate allocation strategy. Experimental results further demonstrate its effectiveness and feasibility in practice.