Lazy receiver processing (LRP): a network subsystem architecture for server systems
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Application performance and flexibility on exokernel systems
Proceedings of the sixteenth ACM symposium on Operating systems principles
Internet Web servers: workload characterization and performance implications
IEEE/ACM Transactions on Networking (TON)
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
Generating representative Web workloads for network and server performance evaluation
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Flow and stretch metrics for scheduling continuous job streams
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Connection Scheduling in Web Servers
Connection Scheduling in Web Servers
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Flash: an efficient and portable web server
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Downlink scheduling in CDMA data networks
MobiCom '00 Proceedings of the 6th annual international conference on Mobile computing and networking
SEDA: an architecture for well-conditioned, scalable internet services
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
The state of the art in locally distributed Web-server systems
ACM Computing Surveys (CSUR)
Content management on server farm with layer-7 routing
Proceedings of the 2002 ACM symposium on Applied computing
Admission control and dynamic adaptation for a proportional-delay diffserv-enabled web server
SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites
Proceedings of the 11th international conference on World Wide Web
Improved algorithms for stretch scheduling
SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
An optimal service ordering for a world wide web server
ACM SIGMETRICS Performance Evaluation Review
Evaluation of packet scheduling algorithms in mobile ad hoc networks
ACM SIGMOBILE Mobile Computing and Communications Review
Improving the performance of interactive TCP applications using service differentiation
Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Towards a new internet architecture
Ensuring Latency Targets in Multiclass Web Servers
IEEE Transactions on Parallel and Distributed Systems
Modeling and performance analysis of QoS-aware load balancing of web-server clusters
Computer Networks: The International Journal of Computer and Telecommunications Networking
Static Document Scheduling with Improved Response Time in HTTP/1.1
ICOIN '02 Revised Papers from the International Conference on Information Networking, Wireless Communications Technologies and Network Applications-Part I
SRPT Scheduling for Web Servers
JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Performance Evaluation with Heavy Tailed Distributions
JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
Non-clairvoyant Scheduling for Minimizing Mean Slowdown
STACS '03 Proceedings of the 20th Annual Symposium on Theoretical Aspects of Computer Science
New Algorithmic Challenges Arising in Measurement-Driven Networking Research
ALENEX '01 Revised Papers from the Third International Workshop on Algorithm Engineering and Experimentation
Exploiting Gray-Box Knowledge of Buffer-Cache Management
ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Performance Evaluation with Heavy Tailed Distributions
TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Size-based scheduling to improve web performance
ACM Transactions on Computer Systems (TOCS)
Preferential treatment for short flows to reduce web latency
Computer Networks: The International Journal of Computer and Telecommunications Networking
Server scheduling in the Lp norm: a rising tide lifts all boat
Proceedings of the thirty-fifth annual ACM symposium on Theory of computing
Analysis of LAS scheduling for job size distributions with high variance
SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Flexibility, Manageability, and Performance in a Grid Storage Appliance
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
The effectiveness of request redirection on CDN robustness
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Revisiting correlations and scheduling in Web servers
ACM SIGMETRICS Performance Evaluation Review - Special issue on the fifth workshop on MAthematical performance Modeling and Analysis (MAMA 2003)
Average stretch without migration
Journal of Computer and System Sciences
Grid resource management
A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation
IEEE Transactions on Parallel and Distributed Systems
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
The origins of network server latency & the myth of connection scheduling
Proceedings of the joint international conference on Measurement and modeling of computer systems
Multi-processor scheduling to minimize flow time with ε resource augmentation
STOC '04 Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
A survey on statistical bandwidth sharing
Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: In memroy of Olga Casals
Theoretical Computer Science - Special issue: Online algorithms in memoriam, Steve Seiden
Traffic aided opportunistic scheduling for wireless networks: algorithms and performance bounds
Computer Networks: The International Journal of Computer and Telecommunications Networking
The effectiveness of request redirection on CDN robustness
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Looking at the server side of peer-to-peer systems
LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Robust Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers
IEEE Transactions on Computers
Web servers under overload: How scheduling can help
ACM Transactions on Internet Technology (TOIT)
Selective early request termination for busy internet services
Proceedings of the 15th international conference on World Wide Web
Understanding the management of client perceived response time
SIGMETRICS '06/Performance '06 Proceedings of the joint international conference on Measurement and modeling of computer systems
Design and evaluation of web proxies by leveraging self-similarity of web traffic
Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Network modelling and simulation
Revisiting unfairness in web server scheduling
Computer Networks: The International Journal of Computer and Telecommunications Networking
System support for web hosting services on server clusters
Computers and Electrical Engineering
ACM SIGMETRICS Performance Evaluation Review
Designing an overload control strategy for secure e-commerce applications
Computer Networks: The International Journal of Computer and Telecommunications Networking
Fuzzy control for guaranteeing absolute delays in web servers
International Journal of High Performance Computing and Networking
Dynamic CPU provisioning for self-managed secure web applications in SMP hosting platforms
Computer Networks: The International Journal of Computer and Telecommunications Networking
Multiple-resource request scheduling for differentiated QoS at website gateway
Computer Communications
Application controlled caching for web servers
Enterprise Information Systems
I/O resource management through system call scheduling
ACM SIGOPS Operating Systems Review - Research and developments in the Linux kernel
Improving peer-to-peer performance through server-side scheduling
ACM Transactions on Computer Systems (TOCS)
Measuring capacity bandwidth of targeted path segments
IEEE/ACM Transactions on Networking (TON)
Dynamic thread assignment in web server performance optimization
Performance Evaluation
Fair, effective, efficient and differentiated scheduling in an enterprise data warehouse
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Optimal scheduling of jobs with a DHR tail in the M/G/1 queue
Proceedings of the 3rd International Conference on Performance Evaluation Methodologies and Tools
Cost-based admission control for Internet Commerce QoS enhancement
Electronic Commerce Research and Applications
Anomaly detection scheme using data mining in mobile environment
ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartII
AWAIT: Efficient overload management for busy multi-tier web services under bursty workloads
ICWE'10 Proceedings of the 10th international conference on Web engineering
Server Scheduling to Balance Priorities, Fairness, and Average Quality of Service
SIAM Journal on Computing
Performance Evaluation of Overload Control in Multi-cluster Grids
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Class-based latency assurances for web servers
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
A weighted-fair-queuing (WFQ)-based dynamic request scheduling approach in a multi-core system
Future Generation Computer Systems
PACMan: coordinated memory caching for parallel jobs
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Analysis and control of correlated web server queues
Computer Communications
Application-specific thread schedulers for internet server applications
Concurrency and Computation: Practice & Experience
Separation of timescales in a two-layered network
Proceedings of the 24th International Teletraffic Congress
Decoupled speed scaling: Analysis and evaluation
Performance Evaluation
Hi-index | 0.02 |
Under high loads, a Web server may be servicing many hundreds of connections concurrently. In traditional Web servers, the question of the order in which concurrent connections are serviced has been left to the operating system. In this paper we ask whether servers might provide better service by using non-traditional service ordering. In particular, for the case when a Web server is serving static files, we examine the costs and benefits of a policy that gives preferential service to short connections. We start by assessing the scheduling behavior of a commonly used server (Apache running on Linux) with respect to connection size and show that it does not appear to provide preferential service to short connections. We then examine the potential performance improvements of a policy that does favor short connections (shortest-connection-first). We show that mean response time can be improved by factors of four or five under shortest-connection-first, as compared to an (Apache-like) size-independent policy. Finally we assess the costs of shortest-connection-first scheduling in terms of unfairness (i.e., the degree to which long connections suffer). We show that under shortest-connection-first scheduling, long connections pay very little penalty. This surprising result can be understood as a consequence of heavy-tailed Web server workloads, in which most connections are small, but most server load is due to the few large connections. We support this explanation using analysis.