A model-driven heuristic approach for detecting multidimensional facts in relational data sources

  • Authors:
  • Andrea Carmè;Jose-Norberto Mazón;Stefano Rizzi

  • Affiliations:
  • Iconsulting, Italy;Lucentia Research Group, Dept. of Software and Computing Systems, University of Alicante, Spain;DEIS, University of Bologna, Italy

  • Venue:
  • DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Facts are multidimensional concepts of primary interests for knowledge workers because they are related to events occurring dynamically in an organization. Normally, these concepts are modeled in operational data sources as tables. Thus, one of the main steps in conceptual design of a data warehouse is to detect the tables that model facts. However, this task may require a high level of expertise in the application domain, and is often tedious and time-consuming for designers. To overcome these problems, a comprehensive model-driven approach is presented in this paper to support designers in: (1) obtaining a CWM model of business-related relational tables, (2) determining which elements of this model can be considered as facts, and (3) deriving their counterparts in a multidimensional schema. Several heuristics -based on structural information derived from data sources- have been defined to this end and included in a set of Query/View/Transformation model transformations.