Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Proportional differentiated services: delay differentiation and packet scheduling
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Characterizing the scalability of a large web-based shopping system
ACM Transactions on Internet Technology (TOIT)
Adaptive proportional delay differentiated services: characterization and performance evaluation
IEEE/ACM Transactions on Networking (TON)
Fast and flexible application-level networking on exokernel systems
ACM Transactions on Computer Systems (TOCS)
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
Proportional differentiated services: delay differentiation and packet scheduling
IEEE/ACM Transactions on Networking (TON)
Admission control and dynamic adaptation for a proportional-delay diffserv-enabled web server
SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Performance Evaluation of Service Differentiating Internet Servers
IEEE Transactions on Computers
JoBS: Joint Buffer Management and Scheduling for Differentiated Services
IWQoS '01 Proceedings of the 9th International Workshop on Quality of Service
Feedback Control with Queueing-Theoretic Prediction for Relative Delay Guarantees in Web Servers
RTAS '03 Proceedings of the The 9th IEEE Real-Time and Embedded Technology and Applications Symposium
Scalable Service Differentiation in a Shared Storage Cache
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
Modeling and Analysis of 2D Service Differentiation on e-Commerce Servers
ICDCS '04 Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS'04)
ICWS '04 Proceedings of the IEEE International Conference on Web Services
IEEE Transactions on Parallel and Distributed Systems
Integrated resource management for cluster-based internet services
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Feedback Utilization Control in Distributed Real-Time Systems with End-to-End Tasks
IEEE Transactions on Parallel and Distributed Systems
An Analytical Approach to Providing Controllable Differentiated Quality of Service in Web Servers
IEEE Transactions on Parallel and Distributed Systems
QMBF: a QoS-aware multicast routing protocol
Computer Communications
An intelligent Quality of Service brokering model for e-commerce
Expert Systems with Applications: An International Journal
A token bucket model with assured forwarding for web traffic
NBiS'07 Proceedings of the 1st international conference on Network-based information systems
Enhancing web server relative delay services by an integrated SA-fuzzy logic controller
International Journal of Web Engineering and Technology
Hi-index | 0.25 |
There is a growing demand for provisioning of different levels of quality of service (QoS) on scalable Web servers to meet changing resource availability and to satisfy different client requirements. In this paper, we investigate the problem of providing proportional QoS differentiation with respect to response time on Web servers. We first present a processing rate allocation scheme based on the foundations of queueing theory. It provides different processing rates to requests of different client classes so as to achieve the differentiation objective. At application level, process is used as the resource allocation principal for achieving processing rates on Apache Web servers. We design and implement an adaptive process allocation approach, guided by the queueing-theoretical rate allocation scheme, on an Apache server. This application-level implementation, however, shows weak QoS predictability because it does not have fine-grained control over the consumption of resources that the kernel consumes and hence the processing rate is not strictly proportional to the number of processes allocated. We then design a feedback controller and integrate it with the queueing-theoretical approach. It adjusts process allocations according to the difference between the target response time and the achieved response time using a proportional integral derivative controller. Experimental results demonstrate that this integrated approach can enable Web servers to provide robust proportional response time differentiation.