On Using Historical Update Information for Instance Identification in Federated Databases

  • Authors:
  • Antonio Si;Chi C. Ying;Dennis McLeod

  • Affiliations:
  • -;-;-

  • Venue:
  • COOPIS '96 Proceedings of the First IFCIS International Conference on Cooperative Information Systems
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

To support database interoperability in federated databases systems, it is critical to be able to identify (potentially) equivalent data instances from individual autonomous database components. Since the components in a federation are autonomous, their data may be updated asynchronously, viz., modifications to a real world entity may be captured in different databases at different times; we term this effect update heterogeneity. Existing approaches largely base data instance similarity identification only on current attribute/property values; in the face of update heterogeneity, this is inadequate. In this paper, we present an approach to address the problem of update heterogeneity in the federated databases context. We employ a probablistic model, which utilizes historical database update information to estimate the degree of similarity between candidate data instances from different database components. We employ transaction history (log) information to this end, which is typically already available in the component database systems. We have experimentally implemented and tested this approach within the context of a prototype experimental federated databases system FeXpress.