An investigation of documents from the World Wide Web
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
WebExpress: a system for optimizing Web browsing in a wireless environment
MobiCom '96 Proceedings of the 2nd annual international conference on Mobile computing and networking
Removal policies in network caches for World-Wide Web documents
Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
Potential benefits of delta encoding and data compression for HTTP
SIGCOMM '97 Proceedings of the ACM SIGCOMM '97 conference on Applications, technologies, architectures, and protocols for computer communication
Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Characteristics of WWW Client-based Traces
Characteristics of WWW Client-based Traces
Optimistic deltas for WWW latency reduction
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Improving end-to-end performance of the Web using server volumes and proxy filters
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Summary cache: a scalable wide-area Web cache sharing protocol
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
A scalable Web cache consistency architecture
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
On the scale and performance of cooperative Web proxy caching
Proceedings of the seventeenth ACM symposium on Operating systems principles
Implications of proxy caching for provisioning networks and servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Summary cache: a scalable wide-area web cache sharing protocol
IEEE/ACM Transactions on Networking (TON)
The content and access dynamics of a busy Web site: findings and implications
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
A proxy-based personal web archiving service
ACM SIGOPS Operating Systems Review
An adaptive model for optimizing performance of an incremental web crawler
Proceedings of the 10th international conference on World Wide Web
A survey of web caching schemes for the Internet
ACM SIGCOMM Computer Communication Review
Web page change and persistence---a four-year longitudinal study
Journal of the American Society for Information Science and Technology
Selfish traffic allocation for server farms
STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Clarifying the fundamentals of HTTP
Proceedings of the 11th international conference on World Wide Web
Optimal crawling strategies for web search engines
Proceedings of the 11th international conference on World Wide Web
Aliasing on the world wide web: prevalence and performance implications
Proceedings of the 11th international conference on World Wide Web
Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites
Proceedings of the 11th international conference on World Wide Web
Bringing the web to the network edge: large caches and satellite distribution
Mobile Networks and Applications
Summary of WWW characterizations
World Wide Web
Improving Proxy Cache Performance: Analysis of Three Replacement Policies
IEEE Internet Computing
Globally Distributed Content Delivery
IEEE Internet Computing
Keeping Up with the Changing Web
Computer
Server Capacity Planning for Web Traffic Workload
IEEE Transactions on Knowledge and Data Engineering
The Evolution of the Web and Implications for an Incremental Crawler
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Characterizing Web Document Change
WAIM '01 Proceedings of the Second International Conference on Advances in Web-Age Information Management
Using Document Features to Optimize Web Cache
ICANN '01 Proceedings of the International Conference on Artificial Neural Networks
Web Structure, Dynamics and Page Quality
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Performance Study of Satellite-Linked Web Caches and Filtering Policies
NETWORKING '00 Proceedings of the IFIP-TC6 / European Commission International Conference on Broadband Communications, High Performance Networking, and Performance of Communication Networks
Text-Based Content Search and Retrieval in Ad-hoc P2P Communities
Revised Papers from the NETWORKING 2002 Workshops on Web Engineering and Peer-to-Peer Computing
Agents, Crawlers, and Web Retrieval
CIA '02 Proceedings of the 6th International Workshop on Cooperative Information Agents VI
WWW '03 Proceedings of the 12th international conference on World Wide Web
A large-scale study of the evolution of web pages
WWW '03 Proceedings of the 12th international conference on World Wide Web
An analysis of Internet content delivery systems
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Measurement, modeling, and analysis of a peer-to-peer file-sharing workload
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Effective page refresh policies for Web crawlers
ACM Transactions on Database Systems (TODS)
Web cache optimization with nonlinear model using object features
Computer Networks: The International Journal of Computer and Telecommunications Networking
Modeling object characteristics of dynamic Web content
Journal of Parallel and Distributed Computing - Scalable web services and architecture
Parameter driven synthetic web database generation
Journal of Systems and Software
Minimal Cost Replication of Dynamic Web Contents under Flat Update Delivery
IEEE Transactions on Parallel and Distributed Systems
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
What's new on the web?: the evolution of the web from a search engine perspective
Proceedings of the 13th international conference on World Wide Web
Impact of search engines on page popularity
Proceedings of the 13th international conference on World Wide Web
Sic transit gloria telae: towards an understanding of the web's decay
Proceedings of the 13th international conference on World Wide Web
Characterization of a large web site population with implications for content delivery
Proceedings of the 13th international conference on World Wide Web
Clarifying the fundamentals of HTTP
Software—Practice & Experience - Special issue: Web technologies
A large-scale study of the evolution of web pages
Software—Practice & Experience - Special issue: Web technologies
Web caching: a way to improve web QoS
Journal of Computer Science and Technology
Coarse-grain replica management strategies for dynamic replication of web contents
Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue on The global Internet
Analysis of lexical signatures for improving information persistence on the World Wide Web
ACM Transactions on Information Systems (TOIS)
IEEE/ACM Transactions on Networking (TON)
Modeling and Managing Content Changes in Text Databases
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
An analysis of internet content delivery systems
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Characterizing a national community web
ACM Transactions on Internet Technology (TOIT)
Looking at both the present and the past to efficiently update replicas of web content
Proceedings of the 7th annual ACM international workshop on Web information and data management
Objective-Optimal Algorithms for Long-Term Web Prefetching
IEEE Transactions on Computers
Computer Networks: The International Journal of Computer and Telecommunications Networking
Modelling information persistence on the web
ICWE '06 Proceedings of the 6th international conference on Web engineering
Eigen-trend: trend analysis in the blogosphere based on singular value decompositions
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Do not crawl in the dust: different urls with similar text
Proceedings of the 16th international conference on World Wide Web
The discoverability of the web
Proceedings of the 16th international conference on World Wide Web
Client behavior and feed characteristics of RSS, a publish-subscribe system for web micronews
IMC '05 Proceedings of the 5th ACM SIGCOMM conference on Internet Measurement
Design, implementation, and evaluation of duplicate transfer detection in HTTP
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Partial prefetch for faster surfing in composite hypermedia
USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Organization-based analysis of web-object sharing and caching
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Cha-Cha: a system for organizing intranet search results
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Factors affecting website reconstruction from the web infrastructure
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Study of piggyback cache validation for proxy caches in the world wide web
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Exploring the bounds of web latency reduction from caching and prefetching
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
HPP: HTML macro-preprocessing to support dynamic document caching
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
System design issues for internet middleware services: deductions from a large client trace
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Increasing effective link bandwidth by suppressing replicated data
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Modeling and managing changes in text databases
ACM Transactions on Database Systems (TODS)
Performance analysis of a client-side caching/prefetching system for Web traffic
Computer Networks: The International Journal of Computer and Telecommunications Networking
Analysis of online video search and sharing
Proceedings of the eighteenth conference on Hypertext and hypermedia
Using neighbors to date web documents
Proceedings of the 9th annual ACM international workshop on Web information and data management
Designing clustering-based web crawling policies for search engine crawlers
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
The Viúva Negra crawler: an experience report
Software—Practice & Experience
WebAccel: Accelerating Web access for low-bandwidth hosts
Computer Networks: The International Journal of Computer and Telecommunications Networking
Estimating the Change of Web Pages
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Do not crawl in the DUST: Different URLs with similar text
ACM Transactions on the Web (TWEB)
The web changes everything: understanding the dynamics of web content
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Resonance on the web: web dynamics and revisitation patterns
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A Study of the Impact of Index Updates on Distributed Query Processing for Web Search
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Changing how people view changes on the web
Proceedings of the 22nd annual ACM symposium on User interface software and technology
Multiple-goal heuristic search
Journal of Artificial Intelligence Research
Automated anomaly detection and performance modeling of enterprise applications
ACM Transactions on Computer Systems (TOCS)
Computer Networks: The International Journal of Computer and Telecommunications Networking
A capture-recapture sampling standardization for improving Internet meta-search
Computer Standards & Interfaces
Computer Networks: The International Journal of Computer and Telecommunications Networking
Caching and Materialization for Web Databases
Foundations and Trends in Databases
Efficiently detecting webpage updates using samples
ICWE'07 Proceedings of the 7th international conference on Web engineering
Understanding content reuse on the web: static and dynamic analyses
WebKDD'06 Proceedings of the 8th Knowledge discovery on the web international conference on Advances in web mining and web usage analysis
Clustering-based incremental web crawling
ACM Transactions on Information Systems (TOIS)
Temporal index sharding for space-time efficiency in archive search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Optimised local caching in cellular mobile networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
An empirical study on the change of web pages
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
World Wide Web
A request-based approach to maintain object consistency in content distribution network
AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
The potential costs and benefits of long-term prefetching for content distribution
Computer Communications
Studying the impact of more complete server information on Web caching
Computer Communications
Distributed caching with centralized control
Computer Communications
Surfing Notes: An Integrated Web Annotation and Archiving Tool
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
An evaluation of caching policies for memento timemaps
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Delta: automatic identification of unknown web-based infection campaigns
Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security
Hi-index | 0.00 |
Caching in the World Wide Web is based on two critical assumptions: that a significant fraction of requests reaccess resources that have already been retrieved; and that those resources do not change between accesses. We tested the validity of these assumptions, and their dependence on characteristics of Web resources, including access rate, age at time of reference, content type, resource size, and Internet top-level domain. We also measured the rate at which resources change, and the prevalence of duplicate copies in the Web. We quantified the potential benefit of a shared proxy-caching server in a large environment by using traces that were collected at the Internet connection points for two large corporations, representing significant numbers of references. Only 22% of the resources referenced in the traces we analyzed were accessed more than once, but about half of the references were to those multiply-referenced resources. Of this half, 13% were to a resource that had been modified since the previous traced reference to it. We found that the content type and rate of access have a strong influence on these metrics, the domain has a moderate influence, and size has little effect. In addition, we studied other aspects of the rate of change, including semantic differences such as the insertion or deletion of anchors, phone numbers, and email addresses.