Semantic mapping and K-means applied to hybrid SOM-based document organization system construction

  • Authors:
  • Renato Fernandes Corrêa;Teresa Bernarda Ludermir

  • Affiliations:
  • Cidade Universitária Recife -- PE, Brazil;Cidade Universitária Recife -- PE, Brazil

  • Venue:
  • Proceedings of the 2008 ACM symposium on Applied computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present and evaluate a hybrid document organization system based on Self-Organizing Maps. The proposed system uses Semantic Mapping to dimensionality reduction and K-means to volume reduction of document vectors of a medium text collection. The vectors obtained after dimensionality and volume reduction steps are used to train the document maps with the SOM algorithm, thus the training time is reduced without compromising the quality of the generated map. We compare experimentally the hybrid system with the correspondent SOM system in organization of documents of Reuters-21758 v1.0 collection. The performances of the systems were measured in terms of classification error in text categorization and training time. The experimental results show that the proposed system generates pretty good document maps with smallest training time.