Two approaches to the integration of heterogeneous data warehouses

  • Authors:
  • Riccardo Torlone

  • Affiliations:
  • Dipartimento di Informatica e Automazione, Università Roma Tre, Roma, Italy

  • Venue:
  • Distributed and Parallel Databases
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we address the problem of integrating independent and possibly heterogeneous data warehouses, a problem that has received little attention so far, but that arises very often in practice.We start by tackling the basic issue of matching heterogeneous dimensions and provide a number of general properties that a dimension matching should fulfill. We then propose two different approaches to the problem of integration that try to enforce matchings satisfying these properties. The first approach refers to a scenario of loosely coupled integration, in which we just need to identify the common information between data sources and perform join operations over the original sources. The goal of the second approach is the derivation of a materialized view built by merging the sources, and refers to a scenario of tightly coupled integration in which queries are performed against the view.We also illustrate architecture and functionality of a practical system that we have developed to demonstrate the effectiveness of our integration strategies.