Matching Attributes across Overlapping Heterogeneous Data Sources Using Mutual Information

  • Authors:
  • Huimin Zhao

  • Affiliations:
  • University of Wisconsin-Milwaukee, USA

  • Venue:
  • Journal of Database Management
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Identifying matching attributes across heterogeneous data sources is a critical and time-consuming step in integrating the data sources. In this paper, the author proposes a method for matching the most frequently encountered types of attributes across overlapping heterogeneous data sources. The author uses mutual information as a unified measure of dependence on various types of attributes. An example is used to demonstrate the utility of the proposed method, which is useful in developing practical attribute matching tools.