Constructing an OLAP cube from distributed XML data

  • Authors:
  • Tapio Niemi;Marko Niinimäki;Jyrki Nummenmaa;Peter Thanisch

  • Affiliations:
  • Helsinki Institute of Physics, Genève, Switzerland;Helsinki Institute of Physics, Genève, Switzerland;University of Tampere, Finland;IBM UK, Edinburgh, Scotland

  • Venue:
  • Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

On-Line Analytical Processing (OLAP) is a powerful method for analysing large data warehouse data. Typically, the data for an OLAP database is collected from a set of data repositories such as e.g. operational databases. This data set is often huge, and it may not be known in advance what data is required and when to perform the desired data analysis tasks. Sometimes it may happen that some parts of the data are only needed occasionally. Therefore, keeping the OLAP database constantly up-to-date is not only a highly demanding task but it also may be overkill in practice.This suggests that in some applications it would be more feasible to form the OLAP cubes only when they are actually needed. We present such a system. As the data sources may well be heterogeneous, we propose an XML language for data collection. Our system also has a facility, where the user may pose a query against a "universal" OLAP cube using the MDX language. The query is analysed to determine which data is required for the desired OLAP cube.