A sample advisor for approximate query processing

  • Authors:
  • Philipp Rösch;Wolfgang Lehner

  • Affiliations:
  • SAP Research Center Dresden, Germany;Database Technology Group, Technische Universität Dresden, Germany

  • Venue:
  • ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The rapid growth of current data warehouse systems makes random sampling a crucial component of modern data management systems. Although there is a large body of work on database sampling, the problem of automatic sample selection remained (almost) unaddressed. In this paper, we tackle the problem with a sample advisor. We propose a cost model to evaluate a sample for a given query. Based on this, our sample advisor determines the optimal set of samples for a given set of queries specified by an expert. We further propose an extension to utilize recorded workload information. In this case, the sample advisor takes the set of queries and a given memory bound into account for the computation of a sample advice. Additionally, we consider the merge of samples in case of overlapping sample advice and present both an exact and a heuristic solution. Within our evaluation, we analyze the properties of the cost model and compare the proposed algorithms. We further demonstrate the effectiveness and the efficiency of the heuristic solutions with a variety of experiments.