Measuring similarity of semi-structured documents with context weights

  • Authors:
  • Christopher C. Yang;Nan Liu

  • Affiliations:
  • The Chinese University of Hong Kong;The Chinese University of Hong Kong

  • Venue:
  • SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2006

Quantified Score

Hi-index 0.02

Visualization

Abstract

In this work, we study similarity measures for text-centric XML documents based on an extended vector space model, which considers both document content and structure. Experimental results based on a benchmark showed superior performance of the proposed measure over the baseline which ignores structural knowledge of XML documents.