Research on Text Clustering Algorithm Based on K_means and SOM

  • Authors:
  • Li Xinwu

  • Affiliations:
  • -

  • Venue:
  • IITAW '08 Proceedings of the 2008 International Symposium on Intelligent Information Technology Application Workshops
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Text clustering is one of the difficult and hot research fields in the internet search engine research. Combination the advantages of K-means clustering and Self-Organizing Model (SOM) techniques, a new text clustering algorithm is presented. Firstly, texts are preprocessed to satisfy succeed process. Then, the paper analyzes common K-means clustering algorithm and SOM algorithm and combines them to overcome efficiency of low stability of K-means algorithm which is very sensitive to the initial cluster center and the isolated point text. The experimental results indicate that the improved algorithm has a higher accuracy and has a better stability, compared with the original algorithm.