Ensuring Latency Targets in Multiclass Web Servers

Authors:
Vikram Kanodia;Edward W. Knightly
Affiliations:
-;-
Venue:
IEEE Transactions on Parallel and Distributed Systems
Year:
2003

Citing 13
Cited 15

Self-similarity in World Wide Web traffic: evidence and possible causes

IEEE/ACM Transactions on Networking (TON)
Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Resource containers: a new facility for resource management in server systems

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Cluster reserves: a mechanism for resource management in cluster-based network servers

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Connection scheduling in web servers

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
The eclipse operating system: providing quality of service via reservation domains

ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Retrofitting quality of service into a time-sharing operating system

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Scalable services via egress admission control

IEEE Transactions on Multimedia
Quality of service guarantees in virtual circuit switched networks

IEEE Journal on Selected Areas in Communications
Admission control for statistical QoS: theory and practice

IEEE Network: The Magazine of Global Internetworking
Web server support for tiered services

IEEE Network: The Magazine of Global Internetworking
A workload characterization study of the 1998 World Cup Web site

IEEE Network: The Magazine of Global Internetworking

A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation

IEEE Transactions on Parallel and Distributed Systems
A method for transparent admission control and request scheduling in e-commerce web sites

Proceedings of the 13th international conference on World Wide Web
Workload-Aware Load Balancing for Clustered Web Servers

IEEE Transactions on Parallel and Distributed Systems
An Analytical Approach to Providing Controllable Differentiated Quality of Service in Web Servers

IEEE Transactions on Parallel and Distributed Systems
Fuzzy control for guaranteeing absolute delays in web servers

International Journal of High Performance Computing and Networking
Multiple-resource request scheduling for differentiated QoS at website gateway

Computer Communications
Energy saving strategies for cooperative cache replacement in mobile ad hoc networks

Pervasive and Mobile Computing
Cost-based admission control for Internet Commerce QoS enhancement

Electronic Commerce Research and Applications
Bandwidth requirement of links in a hierarchical caching network: a graph-based formulation, an algorithm and its performance evaluation

International Journal of Computers and Applications
Analysis and performance study for coordinated hierarchical cache placement strategies

Computer Communications
AWAIT: Efficient overload management for busy multi-tier web services under bursty workloads

ICWE'10 Proceedings of the 10th international conference on Web engineering
A self-healing web server using differentiated services

ICSOC'06 Proceedings of the 4th international conference on Service-Oriented Computing
Class-based latency assurances for web servers

HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Task distribution methods for the reconstruction of MR images

PDCAT'04 Proceedings of the 5th international conference on Parallel and Distributed Computing: applications and Technologies
Deadline and throughput-aware control for request processing systems

ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Two recent advances have resulted in significant improvements in web server quality-of-service. First, both centralized and distributed web servers can provide isolation among service classes by fairly distributing system resources. Second, session admission control can protect classes from performance degradation due to overload. The goal of this work is to design a general 驴front-end驴 algorithm that uses these two building blocks to support a new web service model, namely, multiclass services which control response latencies to within prespecified targets. Our key technique is to devise a general service abstraction to adaptively control not only the latency of a particular class, but also to bound the interclass relationships. In this way, we capture the extent to which classes are isolated or share system resources (as determined by the server architecture and system internals) and hence their effects on each other's QoS. For example, if the server provides class isolation (i.e., a minimum fraction of system resources independent of other classes), yet also allows a class to utilize unused resources from other classes, the algorithm infers and exploits this behavior, without an explicit low level model of the server. Thus, as new functionalities are incorporated into web servers, the approach naturally exploits their properties to efficiently satisfy the classes' performance targets. We validate the scheme with trace driven simulations.