Vmhist: Efficient Multidimensional Histograms with Improved Accuracy

  • Authors:
  • Pedro Furtado;Henrique Madeira

  • Affiliations:
  • -;-

  • Venue:
  • DaWaK 2000 Proceedings of the Second International Conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data warehouses must be able to process and analyze large amounts of information quickly and efficiently. Small summaries provide a very efficient way to obtain fast approximate answers to complex queries that run for too long. This paper proposes an efficient hierarchical partitioning strategy vmhist achieving a large improvement in the accuracy of the summary while maintaining all scalability. This is achieved by pre-computation, localized updating and additivity of the error measures used in the partitioning process. Evaluation reveals that a significant accuracy improvement is obtained for summaries produced with vmhist without significant increase in histogram construction time cost.