XML Document Classification Using Extended VSM

  • Authors:
  • Jianwu Yang;Fudong Zhang

  • Affiliations:
  • Institute of Computer Sci. & Tech., Peking University, Beijing, China 100871;Institute of Computer Sci. & Tech., Peking University, Beijing, China 100871

  • Venue:
  • Focused Access to XML Documents
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Structured link vector model (SLVM) is a representation recently proposed for modeling XML documents, which was extended from the conventional vector space model (VSM) by incorporating document structures. In this paper, we describe the classification approach for XML documents based on SLVM and Support Vector Machine (SVM) in INEX 2007 Document Mining Challenge. The experimental results on the challenge's data set show that it outperforms any other approach on XML document classification task at the challenge.