XMine: a methodology for mining XML structure

  • Authors:
  • Richi Nayak;Wina Iryadi

  • Affiliations:
  • School of Information Systems, Queensland University of Technology, Brisbane, Australia;School of Information Systems, Queensland University of Technology, Brisbane, Australia

  • Venue:
  • APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

XML has become a standard for information exchange and retrieval on the Web. This paper presents the XMine methodology to group heterogeneous XML documents into separate meaningful classes by considering the linguistic and the hierarchical structure similarity. The empirical results demonstrate that the semantic and syntactic relationships and the path names context of elements play important role for producing good quality of clusters.