A relational data harmonization approach to XML

  • Authors:
  • Timo Niemi;Turkka Näppilä;Kalervo Järvelin

  • Affiliations:
  • Department of Computer Sciences, FI-33014, Universityof Tampere, Finland;Department of Computer Sciences, FI-33014, Universityof Tampere, Finland;Department of Information Studies, FI-33014, Universityof Tampere, Finland

  • Venue:
  • Journal of Information Science
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are numerous approaches for integrating data from heterogeneous data sources. A common background assumption is that the data sources remain quite stable and are known in advance. Hence an integration system can be built to manipulate them. In practice there is, however, often a demand for supporting ad hoc information needs concerning unexpected autonomous data sources containing volatile data. A different approach is therefore needed. We propose that semantically similar data are harmonized when extracting data from XML-based data sources. We introduce a constructor algebra, which is a powerful tool in the harmonization of XML data. This algebra is able to form for any XML data source a unique relational representation, called an XML relation. We demonstrate that the XML relation representation supports grouping and aggregation of data needed, for example, in OLAP (online analytical processing) -style applications.