Transparent caching with strong consistency in dynamic content web sites

Authors:
Cristiana Amza;Gokul Soundararajan;Emmanuel Cecchet
Affiliations:
Toronto, Canada;Toronto, Canada;INRIA, Rhone-Alpes, France
Venue:
Proceedings of the 19th annual international conference on Supercomputing
Year:
2005

Citing 11
Cited 8

Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Using LDAP directory caches

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
WebView materialization

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Enabling dynamic content caching for database-driven web sites

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Middle-tier database caching for e-business

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Introduction to Algorithms

Introduction to Algorithms
Caching Strategies for Data-Intensive Web Sites

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Form-Based Proxy Caching for Database-Backed Web Sites

Proceedings of the 27th International Conference on Very Large Data Bases
A Comparative Study of Alternative Middle Tier Caching Solutions to Support Dynamic Web Content Acceleration

Proceedings of the 27th International Conference on Very Large Data Bases
Multi-tier caching of dynamic content for database-driven web sites

Multi-tier caching of dynamic content for database-driven web sites
A Comparative Evaluation of Transparent Scaling Techniques for Dynamic Content Servers

ICDE '05 Proceedings of the 21st International Conference on Data Engineering

A survey on dynamic Web content generation and delivery techniques

Journal of Network and Computer Applications
Semantic caching for pervasive grids

IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Cache sémantique pour grilles pervasives

Proceedings of the 5th French-Speaking Conference on Mobility and Ubiquity Computing
Freshness-aware caching in a cluster of J2EE application servers

WISE'07 Proceedings of the 8th international conference on Web information systems engineering
Schema-based cache validation of dynamic content to improve query performance of web services

Journal of Web Engineering
Gumball: a race condition prevention technique for cache augmented SQL database management systems

DBSocial '12 Proceedings of the 2nd ACM SIGMOD Workshop on Databases and Social Networks
Performance-Enhanced Caching Scheme for Web Clusters for Dynamic Content

International Journal of Business Data Communications and Networking
A comparison of two physical data designs for interactive social networking actions

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider a cluster architecture in which dynamic content is generated by a database back-end and a collection of Web and application server front-ends. We study the effect of transparent query caching on the performance of such a cluster. Transparency requires that cached entries be invalidated as a result of writes. We start with a coarse-grain table-level automatic invalidation cache. Based on observed workload characteristics, we enhance the cache with the necessary dependency tracking and invalidations at the finer granularity of columns. Finally we reduce the miss penalty of invalidations through full and partial coverage of query results.In terms of system design, a query cache may be located at the database back-end, on dedicated machines, on the front-ends, or on a combination thereof. This paper evaluates the tradeoffs of the different cache designs and the cache location using the TPC-W benchmark.Our experiments show that our transparent query cache improves performance very substantially by up to a factor of 1.5 in throughput and 4.2 in response time overall compared to the baseline table-based invalidation scheme. An important contributor to this end result, our optimization for reducing the miss penalty through full and partial coverage detection of query results from the cache improves response time by up to a factor of 2.9 compared to a cache with fine-grained column-based invalidations alone. Thus, the benefits of the higher hit ratio in our optimizations outweigh the costs of additional processing. The results are less clear-cut in terms of where to locate the cache. Performance differences when varying the cache location and the number of caches are small.