Measurements of a distributed file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
A caching relay for the World Wide Web
Selected papers of the first conference on World-Wide Web
Characterizing browsing strategies in the World-Wide Web
Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
Serverless network file systems
ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
Web server workload characterization: the search for invariants
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Efficient cooperative caching using hints
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Performance issues of enterprise level web proxies
SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The case for geographical push-caching
HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Reduce, Reuse, Recycle: An Approach to Building Large Internet Caches
HOTOS '97 Proceedings of the 6th Workshop on Hot Topics in Operating Systems (HotOS-VI)
Characteristics of WWW Client-based Traces
Characteristics of WWW Client-based Traces
Study of piggyback cache validation for proxy caches in the world wide web
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
System design issues for internet middleware services: deductions from a large client trace
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
A hierarchical internet object cache
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Improving end-to-end performance of the Web using server volumes and proxy filters
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Summary cache: a scalable wide-area Web cache sharing protocol
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
On the scale and performance of cooperative Web proxy caching
Proceedings of the seventeenth ACM symposium on Operating systems principles
Implications of proxy caching for provisioning networks and servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Summary cache: a scalable wide-area web cache sharing protocol
IEEE/ACM Transactions on Networking (TON)
WAP traffic: description and comparison to WWW traffic
Proceedings of the 3rd ACM international workshop on Modeling, analysis and simulation of wireless and mobile systems
The content and access dynamics of a busy Web site: findings and implications
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
What TCP/IP protocol headers can tell us about the web
Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Cluster-based online monitoring system of web traffic
Proceedings of the 3rd international workshop on Web information and data management
On filter effects in web caching hierarchies
ACM Transactions on Internet Technology (TOIT)
Analysis of web caching architectures: hierarchical and distributed caching
IEEE/ACM Transactions on Networking (TON)
A survey of web caching schemes for the Internet
ACM SIGCOMM Computer Communication Review
A scalable and highly available system for serving dynamic data at frequently accessed web sites
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Self-organized autonomous web proxies
Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 3
Logically Clustered Architectures for Networked Databases
Distributed and Parallel Databases
ProWGen: a synthetic workload generation tool for simulation evaluation of web proxy caches
Computer Networks: The International Journal of Computer and Telecommunications Networking
Cluster Computing
Summary of WWW characterizations
World Wide Web
A performance study of the Squid proxy on HTTP/1.0
World Wide Web
Web-conscious storage management for web proxies
IEEE/ACM Transactions on Networking (TON)
IEEE Internet Computing
UCFS-A Novel User-Space, High Performance, Customized File System for Web Proxy Servers
IEEE Transactions on Computers
Proxy Cache Algorithms: Design, Implementation, and Performance
IEEE Transactions on Knowledge and Data Engineering
Server Capacity Planning for Web Traffic Workload
IEEE Transactions on Knowledge and Data Engineering
Prefetching Tiled Internet Data Using a Neighbor Selection Markov Chain
IICS '01 Proceedings of the International Workshop on Innovative Internet Computing Systems
Adaptation of a Neighbor Selection Markov Chain for Prefetching Tiled Web GIS Data
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
TStat: TCP STatistic and Analysis Tool
QoS-IP 2003 Proceedings of the Second International Workshop on Quality of Service in Multiservice IP Networks
Replacement Policies for a Distributed Object Caching Service
On the Move to Meaningful Internet Systems, 2002 - DOA/CoopIS/ODBASE 2002 Confederated International Conferences DOA, CoopIS and ODBASE 2002
Internet Cache Location and Design of Content Delivery Networks
Revised Papers from the NETWORKING 2002 Workshops on Web Engineering and Peer-to-Peer Computing
Performance Evaluation of Web Proxy Cache Replacement Policies
TOOLS '98 Proceedings of the 10th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
Robustness of a Neighbor Selection Markov Chain in Prefetching Tiled Web Data
AISA '02 Proceedings of the First International Workshop on Advanced Internet Services and Applications
End-to-end WAN service availability
IEEE/ACM Transactions on Networking (TON)
Scalable techniques for memory-efficient CDN simulations
WWW '03 Proceedings of the 12th international conference on World Wide Web
Journal of Computer Science and Technology
Automatic Selecting of Required NetNews Articles
APSEC '99 Proceedings of the Sixth Asia Pacific Software Engineering Conference
An analysis of Internet content delivery systems
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
On scalable and locality-aware web document sharing
Journal of Parallel and Distributed Computing - Scalable web services and architecture
Characterization of a large web site population with implications for content delivery
Proceedings of the 13th international conference on World Wide Web
Self-organized load balancing in proxy servers: algorithms and performance
Journal of Intelligent Information Systems - Special issue on web intelligence
Web tap: detecting covert web traffic
Proceedings of the 11th ACM conference on Computer and communications security
Measuring IP and TCP behavior on edge nodes with Tstat
Computer Networks: The International Journal of Computer and Telecommunications Networking
An analysis of internet content delivery systems
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Stochastic fluid models for cache clusters
Performance Evaluation
SSL splitting: securely serving data from untrusted caches
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web security
A Distributed Algorithm for Sharing Web Cache Disk Capacity
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
A process of knowledge discovery from web log data: Systematization and critical review
Journal of Intelligent Information Systems
Measurement and analysis of a streaming-media workload
USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
End-to-end WAN service availability
USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Hierarchical cache consistency in a WAN
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Organization-based analysis of web-object sharing and caching
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Active names: flexible location and transport of wide-area resources
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
High-performance caching with the Lava hit-server
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Web++: a system for fast and reliable web service
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Reducing the disk I/O of web proxy server caches
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Youtube traffic characterization: a view from the edge
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
SpyProxy: execution-based detection of malicious web content
SS'07 Proceedings of 16th USENIX Security Symposium on USENIX Security Symposium
SSL splitting: Securely serving data from untrusted caches
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web security
Measuring IP and TCP behavior on edge nodes with Tstat
Computer Networks: The International Journal of Computer and Telecommunications Networking
Saperlipopette!: a distributed web caching systems evaluation tool
Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing
Performance evaluation of user modeling servers under real-world workload condition
UM'03 Proceedings of the 9th international conference on User modeling
Performance evaluation of navy's tactical network using OPNET
MILCOM'06 Proceedings of the 2006 IEEE conference on Military communications
ACM Transactions on the Web (TWEB)
Traffic properties, client side cachability and CDN usage of popular web sites
MMB&DFT'10 Proceedings of the 15th international GI/ITG conference on Measurement, Modelling, and Evaluation of Computing Systems and Dependability and Fault Tolerance
Simulations of distributed service-based content adaptation for network optimization
NGITS'06 Proceedings of the 6th international conference on Next Generation Information Technologies and Systems
Workload analysis of a large-scale key-value store
Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Distributed caching with centralized control
Computer Communications
Hi-index | 0.00 |
The growing popularity of the World Wide Web is placing tremendous demands on the Internet. A key strategy for scaling the Internet to meet these increasing demands is to cache data near clients and thus improve access latency and reduce network and server load. Unfortunately, research in this area has been hampered by a poor understanding of the locality and sharing characteristics of Web-client accesses. The recent popularity of Web proxy servers provides a unique opportunity to improve this understanding, because a small number of proxy servers see accesses from thousands of clients. This paper presents an analysis of access traces collected from seven proxy servers deployed in various locations throughout the Internet. The traces record a total of 47.4 million requests made by 23,700 clients over a twenty-one day period. We use a combination of static analysis and trace-driven cache simulation to characterize the locality and sharing properties of these accesses. Our analysis shows that a 2- to 10-GB second-level cache yields hit rates between 24% and 45% with 85% of these hits due to sharing among different clients. Caches with more clients exhibit more sharing and thus higher hit rates. Between 2% and 7% of accesses are consistency misses to unmodified objects, using the Squid and CERN proxy cache coherence protocols. Sharing is bimodal. Requests for shared objects are divided evenly between objects that are narrowly shared and those that are shared by many clients; widely shared objects also tend to be shared by clients from unrelated traces.