PKU at INEX 2010 XML mining track

  • Authors:
  • Songlin Wang;Feng Liang;Jianwu Yang

  • Affiliations:
  • Institute of Computer Sci. & Tech., Peking University, Beijing, China;Institute of Computer Sci. & Tech., Peking University, Beijing, China;Institute of Computer Sci. & Tech., Peking University, Beijing, China

  • Venue:
  • INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents our participation in the INEX 2010 XML Mining track. Our classification and clustering solutions for XML documents have used both the structure and content information, where the frequent subtrees as structural units are used for content extraction from the XML document. In addition, we used the WordNet and the link information for better performance, and applied the structured link vector model for classification.