XML schema clustering with semantic and hierarchical similarity measures

  • Authors:
  • Richi Nayak;Wina Iryadi

  • Affiliations:
  • School of Information Systems, Queensland University of Technology, Brisbane, Qld., 4001, Australia;School of Information Systems, Queensland University of Technology, Brisbane, Qld., 4001, Australia

  • Venue:
  • Knowledge-Based Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the growing popularity of XML as the data representation language, collections of the XML data are exploded in numbers. The methods are required to manage and discover the useful information from them for the improved document handling. We present a schema clustering process by organising the heterogeneous XML schemas into various groups. The methodology considers not only the linguistic and the context of the elements but also the hierarchical structural similarity. We support our findings with experiments and analysis.