Managing Uncertainty in Schema Matcher Ensembles

  • Authors:
  • Anan Marie;Avigdor Gal

  • Affiliations:
  • Technion --- Israel Institute of Technology,;Technion --- Israel Institute of Technology,

  • Venue:
  • SUM '07 Proceedings of the 1st international conference on Scalable Uncertainty Management
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Schema matching is the task of matching between concepts describing the meaning of data in various heterogeneous, distributed data sources. With many heuristics to choose from, several tools have enabled the use of schema matcher ensembles, combining principles by which different schema matchers judge the similarity between concepts. In this work, we investigate means of estimating the uncertainty involved in schema matching and harnessing it to improve an ensemble outcome. We propose a model for schema matching, based on simple probabilistic principles. We then propose the use of machine learning in determining the best mapping and discuss its pros and cons. Finally, we provide a thorough empirical analysis, using both real-world and synthetic data, to test the proposed technique. We conclude that the proposed heuristic performs well, given an accurate modeling of uncertainty in matcher decision making.