Connection scheduling in web servers

Authors:
Mark E. Crovella;Robert Frangioso;Mor Harchol-Balter
Affiliations:
Department of Computer Science, Boston University, Boston, MA;Department of Computer Science, Boston University, Boston, MA;School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
Venue:
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Year:
1999

Citing 11
Cited 68

Lazy receiver processing (LRP): a network subsystem architecture for server systems

OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Application performance and flexibility on exokernel systems

Proceedings of the sixteenth ACM symposium on Operating systems principles
Internet Web servers: workload characterization and performance implications

IEEE/ACM Transactions on Networking (TON)
Self-similarity in World Wide Web traffic: evidence and possible causes

IEEE/ACM Transactions on Networking (TON)
Generating representative Web workloads for network and server performance evaluation

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Resource containers: a new facility for resource management in server systems

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Flow and stretch metrics for scheduling continuous job streams

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Changes in Web client access patterns: Characteristics and caching implications

World Wide Web
Connection Scheduling in Web Servers

Connection Scheduling in Web Servers
Web facts and fantasy

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Flash: an efficient and portable web server

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference

Downlink scheduling in CDMA data networks

MobiCom '00 Proceedings of the 6th annual international conference on Mobile computing and networking
SEDA: an architecture for well-conditioned, scalable internet services

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
The state of the art in locally distributed Web-server systems

ACM Computing Surveys (CSUR)
Content management on server farm with layer-7 routing

Proceedings of the 2002 ACM symposium on Applied computing
Admission control and dynamic adaptation for a proportional-delay diffserv-enabled web server

SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites

Proceedings of the 11th international conference on World Wide Web
Improved algorithms for stretch scheduling

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
An optimal service ordering for a world wide web server

ACM SIGMETRICS Performance Evaluation Review
Evaluation of packet scheduling algorithms in mobile ad hoc networks

ACM SIGMOBILE Mobile Computing and Communications Review
Improving the performance of interactive TCP applications using service differentiation

Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Towards a new internet architecture
Ensuring Latency Targets in Multiclass Web Servers

IEEE Transactions on Parallel and Distributed Systems
Modeling and performance analysis of QoS-aware load balancing of web-server clusters

Computer Networks: The International Journal of Computer and Telecommunications Networking
Static Document Scheduling with Improved Response Time in HTTP/1.1

ICOIN '02 Revised Papers from the International Conference on Information Networking, Wireless Communications Technologies and Network Applications-Part I
SRPT Scheduling for Web Servers

JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Performance Evaluation with Heavy Tailed Distributions

JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Non-clairvoyant Scheduling for Minimizing Mean Slowdown

STACS '03 Proceedings of the 20th Annual Symposium on Theoretical Aspects of Computer Science
New Algorithmic Challenges Arising in Measurement-Driven Networking Research

ALENEX '01 Revised Papers from the Third International Workshop on Algorithm Engineering and Experimentation
Exploiting Gray-Box Knowledge of Buffer-Cache Management

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Performance Evaluation with Heavy Tailed Distributions

TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Size-based scheduling to improve web performance

ACM Transactions on Computer Systems (TOCS)
Preferential treatment for short flows to reduce web latency

Computer Networks: The International Journal of Computer and Telecommunications Networking
Server scheduling in the Lp norm: a rising tide lifts all boat

Proceedings of the thirty-fifth annual ACM symposium on Theory of computing
Analysis of LAS scheduling for job size distributions with high variance

SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Flexibility, Manageability, and Performance in a Grid Storage Appliance

HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
The effectiveness of request redirection on CDN robustness

ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Revisiting correlations and scheduling in Web servers

ACM SIGMETRICS Performance Evaluation Review - Special issue on the fifth workshop on MAthematical performance Modeling and Analysis (MAMA 2003)
Average stretch without migration

Journal of Computer and System Sciences
References

Grid resource management
A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation

IEEE Transactions on Parallel and Distributed Systems
A method for transparent admission control and request scheduling in e-commerce web sites

Proceedings of the 13th international conference on World Wide Web
The origins of network server latency & the myth of connection scheduling

Proceedings of the joint international conference on Measurement and modeling of computer systems
Multi-processor scheduling to minimize flow time with ε resource augmentation

STOC '04 Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
A survey on statistical bandwidth sharing

Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: In memroy of Olga Casals
Semi-clairvoyant scheduling

Theoretical Computer Science - Special issue: Online algorithms in memoriam, Steve Seiden
Traffic aided opportunistic scheduling for wireless networks: algorithms and performance bounds

Computer Networks: The International Journal of Computer and Telecommunications Networking
The effectiveness of request redirection on CDN robustness

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Looking at the server side of peer-to-peer systems

LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Robust Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers

IEEE Transactions on Computers
Web servers under overload: How scheduling can help

ACM Transactions on Internet Technology (TOIT)
Selective early request termination for busy internet services

Proceedings of the 15th international conference on World Wide Web
Understanding the management of client perceived response time

SIGMETRICS '06/Performance '06 Proceedings of the joint international conference on Measurement and modeling of computer systems
Design and evaluation of web proxies by leveraging self-similarity of web traffic

Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Network modelling and simulation
Revisiting unfairness in web server scheduling

Computer Networks: The International Journal of Computer and Telecommunications Networking
System support for web hosting services on server clusters

Computers and Electrical Engineering
Scheduling network traffic

ACM SIGMETRICS Performance Evaluation Review
Designing an overload control strategy for secure e-commerce applications

Computer Networks: The International Journal of Computer and Telecommunications Networking
Fuzzy control for guaranteeing absolute delays in web servers

International Journal of High Performance Computing and Networking
Dynamic CPU provisioning for self-managed secure web applications in SMP hosting platforms

Computer Networks: The International Journal of Computer and Telecommunications Networking
Multiple-resource request scheduling for differentiated QoS at website gateway

Computer Communications
Application controlled caching for web servers

Enterprise Information Systems
I/O resource management through system call scheduling

ACM SIGOPS Operating Systems Review - Research and developments in the Linux kernel
Improving peer-to-peer performance through server-side scheduling

ACM Transactions on Computer Systems (TOCS)
Measuring capacity bandwidth of targeted path segments

IEEE/ACM Transactions on Networking (TON)
Dynamic thread assignment in web server performance optimization

Performance Evaluation
Fair, effective, efficient and differentiated scheduling in an enterprise data warehouse

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Optimal scheduling of jobs with a DHR tail in the M/G/1 queue

Proceedings of the 3rd International Conference on Performance Evaluation Methodologies and Tools
Cost-based admission control for Internet Commerce QoS enhancement

Electronic Commerce Research and Applications
Anomaly detection scheme using data mining in mobile environment

ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartII
AWAIT: Efficient overload management for busy multi-tier web services under bursty workloads

ICWE'10 Proceedings of the 10th international conference on Web engineering
Server Scheduling to Balance Priorities, Fairness, and Average Quality of Service

SIAM Journal on Computing
Performance Evaluation of Overload Control in Multi-cluster Grids

GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Class-based latency assurances for web servers

HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
A weighted-fair-queuing (WFQ)-based dynamic request scheduling approach in a multi-core system

Future Generation Computer Systems
PACMan: coordinated memory caching for parallel jobs

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Analysis and control of correlated web server queues

Computer Communications
Application-specific thread schedulers for internet server applications

Concurrency and Computation: Practice & Experience
Separation of timescales in a two-layered network

Proceedings of the 24th International Teletraffic Congress
Decoupled speed scaling: Analysis and evaluation

Performance Evaluation

Quantified Score

Hi-index	0.02

Visualization

Abstract

Under high loads, a Web server may be servicing many hundreds of connections concurrently. In traditional Web servers, the question of the order in which concurrent connections are serviced has been left to the operating system. In this paper we ask whether servers might provide better service by using non-traditional service ordering. In particular, for the case when a Web server is serving static files, we examine the costs and benefits of a policy that gives preferential service to short connections. We start by assessing the scheduling behavior of a commonly used server (Apache running on Linux) with respect to connection size and show that it does not appear to provide preferential service to short connections. We then examine the potential performance improvements of a policy that does favor short connections (shortest-connection-first). We show that mean response time can be improved by factors of four or five under shortest-connection-first, as compared to an (Apache-like) size-independent policy. Finally we assess the costs of shortest-connection-first scheduling in terms of unfairness (i.e., the degree to which long connections suffer). We show that under shortest-connection-first scheduling, long connections pay very little penalty. This surprising result can be understood as a consequence of heavy-tailed Web server workloads, in which most connections are small, but most server load is due to the few large connections. We support this explanation using analysis.