Clustering with random indexing K-tree and XML structure

  • Authors:
  • Christopher M. De Vries;Shlomo Geva;Lance De Vine

  • Affiliations:
  • Faculty of Science and Technology, Queensland University of Technology, Brisbane, Australia;Faculty of Science and Technology, Queensland University of Technology, Brisbane, Australia;Faculty of Science and Technology, Queensland University of Technology, Brisbane, Australia

  • Venue:
  • INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the approach taken to the clustering task at INEX 2009 by a group at the Queensland University of Technology. The Random Indexing (RI) K-tree has been used with a representation that is based on the semantic markup available in the INEX 2009 Wikipedia collection. The RI K-tree is a scalable approach to clustering large document collections. This approach has produced quality clustering when evaluated using two different methodologies.