Cache investment: integrating query optimization and distributed data placement

  • Authors:
  • Donald Kossmann;Michael J. Franklin;Gerhard Drasch;Wig Ag

  • Affiliations:
  • Institut fur Informatik, Technical University of Munich, Orleanstr. 34, Munich, D-81667, Germany;University of California, Berkeley, Berkeley, CA;Munich, Germany;Munich, Germany

  • Venue:
  • ACM Transactions on Database Systems (TODS)
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Emerging distributed query-processing systems support flexible execution strategies in which each query can be run using a combination of data shipping and query shipping. As in any distributed environment, these systems can obtain tremendous performance and availability benefits by employing dynamic data caching. When flexible execution and dynamic caching are combined, however, a circular dependency arises: Caching occurs as a by-product of query operator placement, but query operator placement decisions are based on (cached) data location. The practical impact of this dependency is that query optimization decisions that appear valid on a per-query basis can actually cause suboptimal performance for all queries in the long run.To address this problem, we developed Cache Investment - a novel approach for integrating query optimization and data placement that looks beyond the performance of a single query. Cache Investment sometimes intentionally generates a “suboptimal” plan for a particular query in the interest of effecting a better data placement for subsequent queries. Cache Investment can be integrated into a distributed database system without changing the internals of the query optimizer. In this paper, we propose Cache Investment mechanisms and policies and analyze their performance. The analysis uses results from both an implementation on the SHORE storage manager and a detailed simulation model. Our results show that Cache Investment can significantly improve the overall performance of a system and demonstrate the trade-offs among various alternative policies.