Text clustering algorithm based on spectral graph seriation

  • Authors:
  • Guo Wensheng;Li Guohe

  • Affiliations:
  • Department of Computer Science and Technology, China University of Petroleum-Beijing, Beijing, Changping and Key Laboratory of Earth Prospecting and Information Technology Beijing, Beijing, Changp ...;Department of Computer Science and Technology, China University of Petroleum-Beijing, Beijing, Changping

  • Venue:
  • CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the field of information processing, most of the existing text clustering algorithm is based on Vector Space Model(VSM). However, VSM can not effectively express the structure of the text so that it can not fully express the semantic information of the text. In order to improve the ability of expression in the semantic information, this paper presents a new text structure graph model. With the weighted graph, this model expresses the characteristics term of the text and its associated location information. On this basis of spectral graph seriation, a spectral clustering algorithm is put forward. This algorithm replace solving common subgraph with matrix computation, then reduce the computational complexity of graph clustering. There are also algorithm analysis and experiment in the paper. The results of the study show that the text clustering algorithm based on spectral graph seriation is effective and feasible.