Integration and dimensional modeling approaches for complex data warehousing

  • Authors:
  • O. Boussaid;Adrian Tanasescu;Fadila Bentayeb;Jérôme Darmont

  • Affiliations:
  • ERIC, Université Lumière Lyon2, Bron Cedex, France 69676;LIRIS, Université Claude Bernard Lyon 1, Villeurbanne Cedex, France 69422;ERIC, Université Lumière Lyon2, Bron Cedex, France 69676;ERIC, Université Lumière Lyon2, Bron Cedex, France 69676

  • Venue:
  • Journal of Global Optimization
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the broad development of the World Wide Web, various kinds of heterogeneous data (including multimedia data) are now available to decision support tasks. A data warehousing approach is often adopted to prepare data for relevant analysis. Data integration and dimensional modeling indeed allow the creation of appropriate analysis contexts. However, the existing data warehousing tools are well-suited to classical, numerical data. They cannot handle complex data. In our approach, we adapt the three main phases of the data warehousing process to complex data. In this paper, we particularly focus on two main steps in complex data warehousing. The first step is data integration. We define a generic UML model that helps representing a wide range of complex data, including their possible semantic properties. Complex data are then stored in XML documents generated by a piece of software we designed. The second important phase we address is the preparation of data for dimensional modeling. We propose an approach that exploits data mining techniques to assist users in building relevant dimensional models.