Efficient allocation algorithms for OLAP over imprecise data

  • Authors:
  • Doug Burdick;Prasad M. Deshpande;T. S. Jayram;Raghu Ramakrishnan;Shivakumar Vaithyanathan

  • Affiliations:
  • University of Wisconsin, Madison;IBM India Research Lab, SIRC;IBM Almaden Research Center;Yahoo! Research;IBM Almaden Research Center

  • Venue:
  • VLDB '06 Proceedings of the 32nd international conference on Very large data bases
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent work proposed extending the OLAP data model to support data ambiguity, specifically imprecision and uncertainty. A process called allocation was proposed to transform a given imprecise fact table into a form, called the Extended Database, that can be readily used to answer OLAP aggregation queries.In this work, we present scalable, efficient algorithms for creating the Extended Database (i.e., performing allocation) for a given imprecise fact table. Many allocation policies require multiple iterations over the imprecise fact table, and the straightforward evaluation approaches introduced earlier can be highly inefficient. Optimizing iterative allocation policies for large datasets presents novel challenges, and has not been considered previously to the best of our knowledge. In addition to developing scalable allocation algorithms, we present a performance evaluation that demonstrates their efficiency and compares their performance with respect to straight-foward approaches.