Fast Approximate Answers to Aggregate Queries on a Data Cube

  • Authors:
  • Viswanath Poosala;Venkatesh Ganti

  • Affiliations:
  • -;-

  • Venue:
  • SSDBM '99 Proceedings of the 11th International Conference on Scientific and Statistical Database Management
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern decision support systems require very quick (interactive) responses from the DBMS, but pose complex queries on large volumes of data. In this paper, we present a novel solution to this problem: we precompute concise histogram statistics on the data to answer the queries quickly, but approximately. Our hypothesis is that many decision support applications can tolerate small errors in query results in return for large reductions in response times.In particular, we propose the use of multiple histograms to approximate the data cube and answer aggregate queries approximately using this summarized data. We enhance histograms to estimate the quality of the approximate answers. We primarily explore the interaction among various histograms on the data cube in order to minimize the space needed when an upper bound on the errors is given. Our main contribution in this paper is an efficient technique for selecting a provably near-optimal set of histograms on the data cube. Extensive experiments show that our technique results in very accurate and concise statistics.