A self-managing data cache for edge-of-network web applications

Authors:
Khalil Amiri;Sanghyun Park;Renu Tewari
Affiliations:
IBM T. J. Watson Research Center, Hawthorne, NY;Pohang University of Science and Technology, Pohang, Korea;IBM Almaden Research Center, San Jose, CA
Venue:
Proceedings of the eleventh international conference on Information and knowledge management
Year:
2002

Citing 19
Cited 6

String searching algorithms

String searching algorithms
Answering queries using views (extended abstract)

PODS '95 Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Removal policies in network caches for World-Wide Web documents

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
Summary cache: a scalable wide-area Web cache sharing protocol

Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Answering complex SQL queries using automatic summary tables

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
A middleware system which intelligently caches query results

IFIP/ACM International Conference on Distributed systems platforms
Generating efficient plans for queries using views

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Optimizing queries using materialized views: a practical, scalable solution

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Mid-tier caching: the TimesTen approach

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Middle-tier database caching for e-business

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Optimizing Queries with Materialized Views

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Materialized Views in Oracle

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
A Scalable Algorithm for Answering Queries Using Views

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Form-Based Proxy Caching for Database-Backed Web Sites

Proceedings of the 27th International Conference on Very Large Data Bases
Update Propagation Strategies for Improving the Quality of Data on the Web

Proceedings of the 27th International Conference on Very Large Data Bases
Semantic Data Caching and Replacement

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A predicate-based caching scheme for client-server database architectures

The VLDB Journal — The International Journal on Very Large Data Bases
Design Considerations for Distributed Caching on the Internet

ICDCS '99 Proceedings of the 19th IEEE International Conference on Distributed Computing Systems
ICP and the Squid web cache

IEEE Journal on Selected Areas in Communications

Bypass Caching: Making Scientific Databases Good Network Citizens

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Load balancing and data placement for multi-tiered database systems

Data & Knowledge Engineering
Improving parallelism of federated query processing

Data & Knowledge Engineering
Schema-based cache validation of dynamic content to improve query performance of web services

Journal of Web Engineering
Performance-Enhanced Caching Scheme for Web Clusters for Dynamic Content

International Journal of Business Data Communications and Networking
Easy freshness with Pequod cache joins

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Database caching at proxy servers enables dynamic content to be generated at the edge of the network, thereby improving the scalability and response time of web applications. The scale of deployment of edge servers coupled with the rising costs of their administration demand that such caching middleware be adaptive and self-managing. To achieve this, a cache must be dynamically populated and pruned based on the application query stream and access pattern. In this paper, we describe such a cache which maintains a large number of materialized views of previous query results. Cached "views" share physical storage to avoid redundancy, and are usually added and evicted dynamically to adapt to the current workload and to available resources. These two properties of large scale (large number of cached views) and overlapping storage introduce several challenges to query matching and storage management which are not addressed by traditional approaches. In this paper, we describe an edge data cache architecture with a flexible query matching algorithm and a novel storage management policy which work well in such an environment. We perform an evaluation of a prototype of such an architecture using the TPC-W benchmark and find that it reduces query response times by up to 75%, while reducing network and server load.