Web server workload characterization: the search for invariants
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Potential benefits of delta encoding and data compression for HTTP
SIGCOMM '97 Proceedings of the ACM SIGCOMM '97 conference on Applications, technologies, architectures, and protocols for computer communication
On the scale and performance of cooperative Web proxy caching
Proceedings of the seventeenth ACM symposium on Operating systems principles
Workload characterization of a Web proxy in a cable modem environment
ACM SIGMETRICS Performance Evaluation Review
Implications of proxy caching for provisioning networks and servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Analyzing factors that influence end-to-end Web performance
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
On network-aware clustering of Web clients
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
The content and access dynamics of a busy Web site: findings and implications
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
On the use and performance of content distribution networks
IMW '01 Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement
Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites
Proceedings of the 11th international conference on World Wide Web
Summary of WWW characterizations
World Wide Web
Gigascope: a stream database for network applications
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
PRO-COW: Protocol compliance on the web-a longitudinal study
USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Organization-based analysis of web-object sharing and caching
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
The measured access characteristics of world-wide-web client proxy caches
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Rate of change and other metrics: a live study of the world wide web
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
System design issues for internet middleware services: deductions from a large client trace
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Rule-Based Exploratory Testing of Graphical User Interfaces
AGILE '11 Proceedings of the 2011 Agile Conference
DHTTP: an efficient and cache-friendly transfer protocol for the web
IEEE/ACM Transactions on Networking (TON)
Analysis of multimedia workloads with implications for internet streaming
WWW '05 Proceedings of the 14th international conference on World Wide Web
Cost and Response Time Simulation forWeb-based Applications on Mobile Channels
QSIC '05 Proceedings of the Fifth International Conference on Quality Software
Modelling information persistence on the web
ICWE '06 Proceedings of the 6th international conference on Web engineering
Revisiting web server workload invariants in the context of scientific web sites
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Using neighbors to date web documents
Proceedings of the 9th annual ACM international workshop on Web information and data management
Pollution attacks and defenses for Internet caching systems
Computer Networks: The International Journal of Computer and Telecommunications Networking
Performance tuning and cost discovery of mobile web-based applications
International Journal of Web Engineering and Technology
A Quantitative Evaluation of Dissemination-Time Preservation Metadata
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
Wikipedia workload analysis for decentralized hosting
Computer Networks: The International Journal of Computer and Telecommunications Networking
A survey on dynamic Web content generation and delivery techniques
Journal of Network and Computer Applications
CDNsim: A simulation tool for content distribution networks
ACM Transactions on Modeling and Computer Simulation (TOMACS)
3G/HSPA performance in live networks from the end user perspective
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
An automatic HTTP cookie management system
Computer Networks: The International Journal of Computer and Telecommunications Networking
Server workload analysis for power minimization using consolidation
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
An up-to-date survey in web load balancing
World Wide Web
Ensuring content integrity for untrusted peer-to-peer content distribution networks
NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
Accelerating dynamic web content delivery using keyword-based fragment detection
Journal of Web Engineering
On traffic locality and QoE in hybrid CDN-P2P networks
Proceedings of the 44th Annual Simulation Symposium
Traffic properties, client side cachability and CDN usage of popular web sites
MMB&DFT'10 Proceedings of the 15th international GI/ITG conference on Measurement, Modelling, and Evaluation of Computing Systems and Dependability and Fault Tolerance
Speculative validation of web objects for further reducing the user-perceived latency
NETWORKING'10 Proceedings of the 9th IFIP TC 6 international conference on Networking
Workload Characterization and Performance Implications of Large-Scale Blog Servers
ACM Transactions on the Web (TWEB)
Hi-index | 0.01 |
This paper presents a systematic study of the properties of a large number of Web sites hosted by a major ISP. To our knowledge, ours is the first comprehensive study of a large server farm that contains thousands of commercial Web sites. We also perform a simulation analysis to estimate potential performance benefits of content delivery networks (CDNs) for these Web sites. We make several interesting observations about the current usage of Web technologies and Web site performance characteristics. First, compared with previous client workload studies, the Web server farm workload contains a much higher degree of uncacheable responses and responses that require mandatory cache validations. A significant reason for this is that cookie use is prevalent among our population, especially among more popular sites. However, we found an indication of wide-spread indiscriminate usage of cookies, which unnecessarily impedes the use of many content delivery optimizations. We also found that most Web sites do not utilize the cache-control features ofthe HTTP 1.1 protocol, resulting in suboptimal performance. Moreover, the implicit expiration time in client caches for responses is constrained by the maximum values allowed in the Squid proxy. Finally, our simulation results indicate that most Web sites benefit from the use of a CDN. The amount of the benefit depends on site popularity, and, somewhat surprisingly, a CDN may increase the peak to average request ratio at the origin server because the CDN can decrease the average request rate more than the peak request rate.