Pre-aggregation with probability distributions

  • Authors:
  • Igor Timko;Curtis E. Dyreson;Torben Bach Pedersen

  • Affiliations:
  • Free University of Bozen-Bolzano;Washington State University;Aalborg University

  • Venue:
  • DOLAP '06 Proceedings of the 9th ACM international workshop on Data warehousing and OLAP
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Motivated by the increasing need to analyze complex, uncertain multidimensional data this paper proposes probabilistic OLAP queries that are computed using probability distributions rather than atomic values. The paper describes how to create probability distributions from base data, and how the distributions can be subsequently used in pre-aggregation. Since the probability distributions can become large, we show how to achieve good time and space efficiency by approximating the distributions. We present the results of several experiments that demonstrate the effectiveness of our methods. The work is motivatedwith a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. This paper is the first to consider the approximate processing of probabilistic OLAP queries over probability distributions.