Hybrid Neural Document Clustering Using Guided Self-Organization and WordNet

  • Authors:
  • Chihli Hung;Stefan Wermter;Peter Smith

  • Affiliations:
  • -;-;-

  • Venue:
  • IEEE Intelligent Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Document clustering is usually performed under the assumption that classification knowledge is unavailable; document classification, however, uses a classified data set for training. The supervised classification approach often achieves greater accuracy than the unsupervised clustering method. If the corpus of documents offers topical categorization, however, clustering can potentially exploit this domain knowledge by moving from an unsupervised to a partially supervised, guided self-organization. In this case, using a neural guided self-organizing network as a metaclassifier on category information offers the opportunity to exploit the domain knowledge. The authors introduce a novel combination of bottom-up dynamic neural learning with top-down symbolic WordNet processing. They show that the hypernym semantic relationship in WordNet complements the neural model, improving classification accuracy and clustering analysis.