Handling Multiple Bottlenecks in Web Servers Using Adaptive Inbound Controls

Authors:
Thiemo Voigt;Per Gunningberg
Affiliations:
-;-
Venue:
PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Year:
2002

Citing 17
Cited 9

Web server workload characterization: the search for invariants

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Generating representative Web workloads for network and server performance evaluation

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Cluster reserves: a mechanism for resource management in cluster-based network servers

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A client-aware dispatching algorithm for web clusters providing multiple services

Proceedings of the 10th international conference on World Wide Web
A scalable and highly available system for serving dynamic data at frequently accessed web sites

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Kernel-based control of persistent web server connections

ACM SIGMETRICS Performance Evaluation Review
Application-level differentiated services for Web servers

World Wide Web
Dynamic Load Balancing on Web-Server Systems

IEEE Internet Computing
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Size-based scheduling to improve web performance

ACM Transactions on Computer Systems (TOCS)
A Feedback Control Approach for Guaranteeing Relative Delays in Web Servers

RTAS '01 Proceedings of the Seventh Real-Time Technology and Applications Symposium (RTAS '01)
Measuring the capacity of a web server

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Web facts and fantasy

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Scalable content-aware request distribution in cluster-based networks servers

ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
HACC: an architecture for cluster-based web servers

WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Web server support for tiered services

IEEE Network: The Magazine of Global Internetworking

Overload Behaviour and Protection of Event-driven Web Servers

Revised Papers from the NETWORKING 2002 Workshops on Web Engineering and Peer-to-Peer Computing
Using Generative Design Patterns to Develop Network Server Applications

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4 - Volume 05
Adversarial exploits of end-systems adaptation dynamics

Journal of Parallel and Distributed Computing
Acceptable strategies for improving web server performance

ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Botz-4-sale: surviving organized DDoS attacks that mimic flash crowds

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Fuzzy control for guaranteeing absolute delays in web servers

International Journal of High Performance Computing and Networking
Adaptive run-time performance optimization through scalable client request rate control

Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering
Overload protection for commodity network appliances

ACSAC'06 Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture
A resource-based server performance control for grid computing systems

NPC'05 Proceedings of the 2005 IFIP international conference on Network and Parallel Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Web servers become overloaded when one or several server resources are overutilized. In this paper we present an adaptive architecture that prevents resource overutilization in web servers by performing admission control based on application-level information found in HTTP headers and knowledge about resource consumption of requests. In addition, we use an efficient early discard mechanism that consumes only a small amount of resources when rejecting requests. This mechanism first comes into play when the request rate is very high in order to avoid making uninformed request rejections that might abort ongoing sessions. We present our dual admission control architecture and various experiments that show that it can sustain high throughput and low response times even during high load.