Concept features extraction and text clustering analysis of neural networks based on cognitive mechanism

  • Authors:
  • Lin Wang;Minghu Jiang;Shasha Liao;Beixing Deng;Chengqing Zong;Yinghua Lu

  • Affiliations:
  • School of Electronic Eng., Beijing Univ. of Post and Telecom, Beijing, China;School of Electronic Eng., Beijing Univ. of Post and Telecom, Beijing, China;Lab of Computational Linguistics, School of Humanities and Social Sciences, Tsinghua University, Beijing, China;Dept. of Electronic Eng., Tsinghua University, Beijing, China;State Key Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Science, Beijing, China;School of Electronic Eng., Beijing Univ. of Post and Telecom, Beijing, China

  • Venue:
  • ICIC'06 Proceedings of the 2006 international conference on Intelligent Computing - Volume Part I
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The feature selection is an important part in automatic classification. In this paper, we use the HowNet to extract the concept attributes, and propose CHI-MCOR method to build a feature set. This method not only selects the highly occurring words, but also selects the word whose occurrence frequency is middle or low occurring words that are important for text classification. The combined method is much better than any one of the weight methods. Then we use the Self-Organizing Map (SOM) to realize automatic text clustering. The experiment result shows that if we can extract the sememes properly, we can not only reduce the feature dimension but also improve the classification precise. SOM can be used in text clustering in large scales and the clustering results are good when the concept feature is selected.