Accommodating instance heterogeneities in database integration

  • Authors:
  • Ee-Peng Lim;Roger H. L. Chiang

  • Affiliations:
  • Center for Advanced Information Systems, School of Computer Engineering, Nanyang Technological University, N4-2a-32, Nanyang Avenue, Singapore 639798, Singapore;Information Systems Department, College of Business, University of Cincinnati, Cincinnati, OH

  • Venue:
  • Decision Support Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

A complete data integration solution can be viewed as an iterative process that consists of three phases, namely analysis, derivation and evolution. The entire process is similar to a software development process with the target application being the derivation roles for the integrated databases. In many cases, data integration requires several iterations of refining the local-to-global database mapping rules before a stable set of rules can be obtained. In particular, the mapping rules, as well as the data model and query model for the integrated databases have to cope with poor data quality in local databases, ongoing local database updates and instance heterogeneities. In this paper, we therefore propose a new object-oriented global data model, known as OORA, that can accommodate attribute and relationship instance heterogeneities in the integrated databases. The OORA model has been designed to allow database integrators and end users to query both the local and resolved instance values using the same query language throughout the derivation and evolution phases of database integration. Coupled with the OORA model, we also define a set of local-to-global database mapping rules that can detect new heterogeneities among databases and resolve instance heterogeneities if situations permit.