Performance problems in BSD4. 4TCP
ACM SIGCOMM Computer Communication Review
The case for persistent-connection HTTP
SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Lazy receiver processing (LRP): a network subsystem architecture for server systems
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Dummynet: a simple approach to the evaluation of network protocols
ACM SIGCOMM Computer Communication Review
Multiprocessor scheduling with client resources to improve the response time of WWW applications
ICS '97 Proceedings of the 11th international conference on Supercomputing
Generating representative Web workloads for network and server performance evaluation
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Analysis of Task Assignment Policies in Scalable Distributed Web-Server Systems
IEEE Transactions on Parallel and Distributed Systems
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
A performance evaluation of hyper text transfer protocols
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Web content adaptation to improve server overload behavior
WWW '99 Proceedings of the eighth international conference on World Wide Web
Flow and stretch metrics for scheduling continuous job streams
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Modeling TCP Reno performance: a simple model and its empirical validation
IEEE/ACM Transactions on Networking (TON)
The effects of wide-area conditions on WWW server performance
Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Analysis of SRPT scheduling: investigating unfairness
Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Web protocols and practice: HTTP/1.1, Networking protocols, caching, and traffic measurement
Web protocols and practice: HTTP/1.1, Networking protocols, caching, and traffic measurement
EW 7 Proceedings of the 7th workshop on ACM SIGOPS European workshop: Systems support for worldwide applications
On the use and performance of content distribution networks
IMW '01 Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement
Computer Networking: A Top-Down Approach Featuring the Internet
Computer Networking: A Top-Down Approach Featuring the Internet
Operating System Concepts
Kernel-based control of persistent web server connections
ACM SIGMETRICS Performance Evaluation Review
Measuring the capacity of a Web server under realistic loads
World Wide Web
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Size-based scheduling to improve web performance
ACM Transactions on Computer Systems (TOCS)
Analysis of LAS scheduling for job size distributions with high variance
SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Fairness and efficiency in web server protocols
SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Classifying scheduling policies with respect to unfairness in an M/GI/1
SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A scalable and highly available web server
COMPCON '96 Proceedings of the 41st IEEE International Computer Conference
SWIFT: Scheduling in Web Servers for Fast Response Time
NCA '03 Proceedings of the Second IEEE International Symposium on Network Computing and Applications
Application-level document caching in the Internet
SDNE '95 Proceedings of the 2nd International Workshop on Services in Distributed and Networked Environments
Priority Mechanisms for OLTP and Transactional Web Applications
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Adaptive overload control for busy internet servers
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Connection scheduling in web servers
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Eliminating receive livelock in an interrupt-driven kernel
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Flash: an efficient and portable web server
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
A workload characterization study of the 1998 World Cup Web site
IEEE Network: The Magazine of Global Internetworking
Selective early request termination for busy internet services
Proceedings of the 15th international conference on World Wide Web
Revisiting unfairness in web server scheduling
Computer Networks: The International Journal of Computer and Telecommunications Networking
Implementing class of service in web server using adaptive scheduling and admission control
WebMedia '06 Proceedings of the 12th Brazilian Symposium on Multimedia and the web
ACM SIGMETRICS Performance Evaluation Review
Predicting short-transfer latency from TCP arcana: a trace-based validation
IMC '05 Proceedings of the 5th ACM SIGCOMM conference on Internet Measurement
Improving peer-to-peer performance through server-side scheduling
ACM Transactions on Computer Systems (TOCS)
Modeling and exploiting query interactions in database systems
Proceedings of the 17th ACM conference on Information and knowledge management
Optimizing i/o-intensive transactions in highly interactive applications
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Caching and Materialization for Web Databases
Foundations and Trends in Databases
Characterizing, modeling, and generating workload spikes for stateful services
Proceedings of the 1st ACM symposium on Cloud computing
Design and implementation of a generic resource sharing virtual time dispatcher
Proceedings of the 3rd Annual Haifa Experimental Systems Conference
HELP: // hypertext in-emergency leveraging protocol
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
The Journal of Supercomputing
A predictive and probabilistic load-balancing algorithm for cluster-based web servers
Applied Soft Computing
Server Scheduling to Balance Priorities, Fairness, and Average Quality of Service
SIAM Journal on Computing
Towards more effective utilization of computer systems
Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering
EFD: an efficient low-overhead scheduler
NETWORKING'11 Proceedings of the 10th international IFIP TC 6 conference on Networking - Volume Part II
Interaction-aware scheduling of report-generation workloads
The VLDB Journal — The International Journal on Very Large Data Bases
A research study on business-oriented quality-driven request service in a B2C web site
ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II
Performance Evaluation of Overload Control in Multi-cluster Grids
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
When Ambient Intelligence meets the Internet: User Module framework and its applications
Computer Networks: The International Journal of Computer and Telecommunications Networking
Adaptive admission control algorithm in a QoS-aware Web system
Information Sciences: an International Journal
A weighted-fair-queuing (WFQ)-based dynamic request scheduling approach in a multi-core system
Future Generation Computer Systems
Memory performance prediction of web server applications based on grey system theory
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Budget-based control for interactive services with adaptive execution
Proceedings of the 9th international conference on Autonomic computing
Resource utilization prediction: a proposal for information technology research
Proceedings of the 1st Annual conference on Research in information technology
WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
QACO: exploiting partial execution in web servers
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Hi-index | 0.00 |
This article provides a detailed implementation study on the behavior of web serves that serve static requests where the load fluctuates over time (transient overload). Various external factors are considered, including WAN delays and losses and different client behavior models. We find that performance can be dramatically improved via a kernel-level modification to the web server to change the scheduling policy at the server from the standard FAIR (processor-sharing) scheduling to SRPT (shortest-remaining-processing-time) scheduling. We find that SRPT scheduling induces no penalties. In particular, throughput is not sacrificed and requests for long files experience only negligibly higher response times under SRPT than they did under the original FAIR scheduling.