Machine learning models: combining evidence of similarity for XML schema matching

  • Authors:
  • Tran Hong-Minh;Dan Smith

  • Affiliations:
  • School of Computing Sciences, University of Of East Anglia, Norwich, UK;School of Computing Sciences, University of Of East Anglia, Norwich, UK

  • Venue:
  • KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Matching schemas at an element level or structural level is generally categorized as either hybrid, which uses one algorithm, or composite, which combines evidence from several different matching algorithms for the final similarity measure. We present an approach for combining element-level evidence of similarity for matching XML schemas with a composite approach. By combining high recall algorithms in a composite system we reduce the number of real matches missed. By performing experiments on a number of machine learning models for combination of evidence in a composite approach and choosing the SMO for the high precision and recall, we increase the reliability of the final matching results. The precision is therefore enhanced (e.g., with data sets used by Cupid and suggested by the author of LSD, our precision is respectively 13.05% and 31.55% higher than COMA and Cupid on average).