Engineering web cache consistency

Authors:
Jian Yin;Lorenzo Alvisi;Mike Dahlin;Arun Iyengar
Affiliations:
University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX;IBM T. J. Watson Research Center, Yorktown Heights, NY
Venue:
ACM Transactions on Internet Technology (TOIT)
Year:
2002

Citing 26
Cited 14

Scale and performance in a distributed file system

ACM Transactions on Computer Systems (TOCS)
Caching in the Sprite network file system

ACM Transactions on Computer Systems (TOCS)
Leases: an efficient fault-tolerant mechanism for distributed file cache consistency

SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
Recovery in Spritely NFS

Computing Systems
Fast crash recovery in distributed file systems

Fast crash recovery in distributed file systems
Improving end-to-end performance of the Web using server volumes and proxy filters

Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Piggyback server invalidation for proxy cache coherency

WWW7 Proceedings of the seventh international conference on World Wide Web 7
A scalable Web cache consistency architecture

Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
On the scale and performance of cooperative Web proxy caching

Proceedings of the seventeenth ACM symposium on Operating systems principles
Synchronizing a database to improve freshness

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
How dynamic is the Web?

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
The content and access dynamics of a busy Web site: findings and implications

Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
A scalable and highly available system for serving dynamic data at frequently accessed web sites

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Refreshment policies for web content caches

Computer Networks: The International Journal of Computer and Telecommunications Networking
Analysis and characterization of large-scale Web server access patterns and performance

World Wide Web
Volume Leases for Consistency in Large-Scale Systems

IEEE Transactions on Knowledge and Data Engineering
Maintaining Strong Cache Consistency in the World-Wide Web

ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Using Leases to Support Server-Driven Consistency in Large-Scale Systems

ICDCS '98 Proceedings of the The 18th International Conference on Distributed Computing Systems
Design Considerations for Distributed Caching on the Internet

ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
Adaptive Lease: A Strong Consistency Mechanism for the World Wide Web

Adaptive Lease: A Strong Consistency Mechanism for the World Wide Web
End-to-end WAN service availability

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Scalable web caching of frequently updated objects using reliable multicast

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Hierarchical cache consistency in a WAN

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Organization-based analysis of web-object sharing and caching

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Improving web server performance by caching dynamic data

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
World-wide web cache consistency

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference

Replication for web hosting systems

ACM Computing Surveys (CSUR)
Replication for web hosting systems

ACM Computing Surveys (CSUR)
Transparent information dissemination

Proceedings of the 5th ACM/IFIP/USENIX international conference on Middleware
A generalized model for characterizing content modification dynamics of web objects

Web content caching and distribution
Measuring consistency in TTL-based caches

Performance Evaluation - Performance 2005
Temporal pre-fetching of dynamic web pages

Information Systems
Strongly consistent access algorithms for wireless data networks

Wireless Networks
A Data-Mining-Based Prefetching Approach to Caching for Network Storage Systems

INFORMS Journal on Computing
Scalable Delivery of Dynamic Content Using a Cooperative Edge Cache Grid

IEEE Transactions on Knowledge and Data Engineering
Maintaining Strong Cache Consistency for the Domain Name System

IEEE Transactions on Knowledge and Data Engineering
Resource discovery in federated systems with voluntary sharing

Proceedings of the 2007 ACM/IFIP/USENIX international conference on Middleware companion
Temporal pre-fetching of dynamic web pages

Information Systems
Optimising context data dissemination and storage in distributed pervasive computing systems

Pervasive and Mobile Computing
The MACE Approach for Caching Mashups

International Journal of Web Services Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Server-driven consistency protocols can reduce read latency and improve data freshness for a given network and server overhead, compared to the traditional consistency protocols that rely on client polling. Server-driven consistency protocols appear particularly attractive for large-scale dynamic Web workloads because dynamically generated data can change rapidly and unpredictably. However, there have been few reports on engineering server-driven consistency for such workloads. This article reports our experience in engineering server-driven consistency for a sporting and event Web site hosted by IBM, one of the most popular sites on the Internet for the duration of the event. We also examine an e-commerce site for a national retail store. Our study focuses on scalability and cachability of dynamic content. To assess scalability, we measure both the amount of state that a server needs to maintain to ensure consistency and the bursts of load in sending out invalidation messages when a popular object is modified. We find that server-driven protocols can cap the size of the server's state to a given amount without significant performance costs, and can smooth the bursts of load with minimal impact on the consistency guarantees. To improve performance, we systematically investigate several design issues for which prior research has suggested widely different solutions, including whether servers should send invalidations to idle clients. Finally, we quantify the performance impact of caching dynamic data with server-driven consistency protocols and the benefits of server-driven consistency protocols for large-scale dynamic Web services. We find that (i) caching dynamically generated data can increase cache hit rates by up to 10%, compared to the systems that do not cache dynamically generated data; and (ii) server-driven consistency protocols can increase cache hit rates by a factor of 1.5-3 for large-scale dynamic Web services, compared to client polling protocols. We have implemented a prototype of a server-driven consistency protocol based on our findings by augmenting the popular Squid cache.