Optimizing Selections over Datacubes

  • Authors:
  • Kenneth A. Ross;Kazi A. Zaman

  • Affiliations:
  • -;-

  • Venue:
  • SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Datacube queries compute aggregates over database relations at a variety of granularities. Often one wants only datacube output tuples whose aggregate value satisfies a certain condition, such as exceeding a given threshold. We develop algorithms for processing a datacube query using the selection condition internally during the computation. Thus, we can safely prune parts of the computation and end up with a more efficient computation of the answer. Our first technique, called 驴specialization驴, uses the fact that a tuple in the datacube does not meet the given threshold to infer that not all finer level aggregates can meet the threshold. Our second technique is called 驴generalization驴, and applies in the case where the actual value of the aggregate is not needed in the output, but used just to compare with the threshold. We demonstrate the efficiency of these techniques by implementing them within the sparse datacube algorithm of Ross and Srivastava. We present a performance study using synthetic and real-world data sets. Our results indicate substantial performance improvements for queries with selective conditions.