Text data clustering by contextual graphs

  • Authors:
  • Krzysztof Ciesielski;Mieczysław A. Kłopotek

  • Affiliations:
  • Institute of Computer Science, Polish Academy of Sciences, Warszawa, Poland;Institute of Computer Science, Polish Academy of Sciences, Warszawa, Poland

  • Venue:
  • DS'06 Proceedings of the 9th international conference on Discovery Science
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we focus on the class of graph-based clustering models, such as growing neural gas or idiotypic nets for the purpose of high-dimensional text data clustering. We present a novel approach, which does not require operation on the complex overall graph of clusters, but rather allows to shift majority of effort to context-sensitive, local subgraph and local sub-space processing. Savings of orders of magnitude in processing time and memory can be achieved, while the quality of clusters is improved, as presented experiments demonstrate.