Similarity Measurement of XML Documents Based on Structure and Contents

  • Authors:
  • Tae-Soon Kim;Ju-Hong Lee;Jae-Won Song;Deok-Hwan Kim

  • Affiliations:
  • Dept. of Computer Science & Information Engineering, Inha University, Incheon, Korea;Dept. of Computer Science & Information Engineering, Inha University, Incheon, Korea;Dept. of Computer Science & Information Engineering, Inha University, Incheon, Korea;Dept. of Electronics Engineering, Inha University,

  • Venue:
  • ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Researches on the similarity measure between XML documents are being progressed in order to effectively control and retrieve various XML documents. Previous works mostly suggest similarity-measuring methods focusing only on the tag structure of XML documents. However, they have a problem of incorrectly calculating the semantic similarity of XML contents. In this paper, we propose a new similarity measurement method considering not only the structural information of tags in XML documents but also the semantic information of tags and text content information related with the tags. Our experiments demonstrate that our proposed method improves the accuracy of similarity, compared to the previous works.