Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Cluster reserves: a mechanism for resource management in cluster-based network servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Connection scheduling in web servers
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Cost-aware WWW proxy caching algorithms
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
The eclipse operating system: providing quality of service via reservation domains
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Retrofitting quality of service into a time-sharing operating system
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Scalable services via egress admission control
IEEE Transactions on Multimedia
Quality of service guarantees in virtual circuit switched networks
IEEE Journal on Selected Areas in Communications
Admission control for statistical QoS: theory and practice
IEEE Network: The Magazine of Global Internetworking
Web server support for tiered services
IEEE Network: The Magazine of Global Internetworking
A workload characterization study of the 1998 World Cup Web site
IEEE Network: The Magazine of Global Internetworking
A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation
IEEE Transactions on Parallel and Distributed Systems
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Workload-Aware Load Balancing for Clustered Web Servers
IEEE Transactions on Parallel and Distributed Systems
An Analytical Approach to Providing Controllable Differentiated Quality of Service in Web Servers
IEEE Transactions on Parallel and Distributed Systems
Fuzzy control for guaranteeing absolute delays in web servers
International Journal of High Performance Computing and Networking
Multiple-resource request scheduling for differentiated QoS at website gateway
Computer Communications
Energy saving strategies for cooperative cache replacement in mobile ad hoc networks
Pervasive and Mobile Computing
Cost-based admission control for Internet Commerce QoS enhancement
Electronic Commerce Research and Applications
International Journal of Computers and Applications
Analysis and performance study for coordinated hierarchical cache placement strategies
Computer Communications
AWAIT: Efficient overload management for busy multi-tier web services under bursty workloads
ICWE'10 Proceedings of the 10th international conference on Web engineering
A self-healing web server using differentiated services
ICSOC'06 Proceedings of the 4th international conference on Service-Oriented Computing
Class-based latency assurances for web servers
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Task distribution methods for the reconstruction of MR images
PDCAT'04 Proceedings of the 5th international conference on Parallel and Distributed Computing: applications and Technologies
Deadline and throughput-aware control for request processing systems
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Hi-index | 0.00 |
Two recent advances have resulted in significant improvements in web server quality-of-service. First, both centralized and distributed web servers can provide isolation among service classes by fairly distributing system resources. Second, session admission control can protect classes from performance degradation due to overload. The goal of this work is to design a general 驴front-end驴 algorithm that uses these two building blocks to support a new web service model, namely, multiclass services which control response latencies to within prespecified targets. Our key technique is to devise a general service abstraction to adaptively control not only the latency of a particular class, but also to bound the interclass relationships. In this way, we capture the extent to which classes are isolated or share system resources (as determined by the server architecture and system internals) and hence their effects on each other's QoS. For example, if the server provides class isolation (i.e., a minimum fraction of system resources independent of other classes), yet also allows a class to utilize unused resources from other classes, the algorithm infers and exploits this behavior, without an explicit low level model of the server. Thus, as new functionalities are incorporated into web servers, the approach naturally exploits their properties to efficiently satisfy the classes' performance targets. We validate the scheme with trace driven simulations.