Warehousing complex data from the web

  • Authors:
  • O. Boussaid;J. Darmont;F. Bentayeb;S. Loudcher

  • Affiliations:
  • ERIC, University of Lyon 2, 5 Avenue Pierre Mendes-France, 69676 Bron Cedex, France.;ERIC, University of Lyon 2, 5 Avenue Pierre Mendes-France, 69676 Bron Cedex, France.;ERIC, University of Lyon 2, 5 Avenue Pierre Mendes-France, 69676 Bron Cedex, France.;ERIC, University of Lyon 2, 5 Avenue Pierre Mendes-France, 69676 Bron Cedex, France

  • Venue:
  • International Journal of Web Engineering and Technology
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data warehousing and Online Analytical Processing (OLAP)technologies are now moving onto handling complex data that mostlyoriginate from the web. However, integrating such data into adecision-support process requires their representation in a formprocessable by OLAP and/or data mining techniques. We present inthis paper a complex data warehousing methodology that exploitseXtensible Markup Language (XML) as a pivot language. Our approachincludes the integration of complex data in an ODS, in the form ofXML documents; their dimensional modelling and storage in an XMLdata warehouse; and their analysis with combined OLAP and datamining techniques. We also address the crucial issue of performancein XML warehouses.