Multidimensional database design from document-centric XML documents

  • Authors:
  • Geneviève Pujolle;Franck Ravat;Olivier Teste;Ronan Tournier;Gilles Zurfluh

  • Affiliations:
  • Universitè de Toulouse, Toulouse 1 Capitole, IRIT (UMR5505), Toulouse Cedex, France;Universitè de Toulouse, Toulouse 1 Capitole, IRIT (UMR5505), Toulouse Cedex, France;Toulouse 3 Paul Sabatier, IRIT (UMR5505), Toulouse Cedex, France;Universitè de Toulouse, Toulouse 1 Capitole, IRIT (UMR5505), Toulouse Cedex, France;Universitè de Toulouse, Toulouse 1 Capitole, IRIT (UMR5505), Toulouse Cedex, France

  • Venue:
  • DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Despite a decade of research in OLAP systems, very few works attempt to tackle the problem of analysing data extracted from XML text-rich documents. These documents are loosely structured XML documents mainly composed of text. This paper details conceptual design steps of multidimensional databases from such documents. With the use of an adapted multidimensional conceptual model, the design process allows the integration of data extracted from text-rich XML documents within an adapted OLAP system.