A new label maximization based incremental neural clustering approach: application to text clustering

  • Authors:
  • Jean-Charles Lamirel;Raghvendra Mall;Shadi Al Shehabi;Ghada Safi

  • Affiliations:
  • LORIA, Campus Scientifique, Vandoeuvre-lès-Nancy, France;Center of Data Engineering, IIIT Hyderabad, Hyderabad, Andhra Pradesh, India;Department of Mathematics, Faculty of Science, Aleppo University, Aleppo, Syria;Department of Mathematics, Faculty of Science, Aleppo University, Aleppo, Syria

  • Venue:
  • WSOM'11 Proceedings of the 8th international conference on Advances in self-organizing maps
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Neural clustering algorithms show high performance in the general context of the analysis of homogeneous textual dataset. This is especially true for the recent adaptive versions of these algorithms, like the incremental growing neural gas algorithm (IGNG) and the label maximization based incremental growing neural gas algorithm (IGNG-F). In this paper we highlight that there is a drastic decrease of performance of these algorithms, as well as the one of more classical algorithms, when a heterogeneous textual dataset is considered as an input. Specific quality measures and cluster labeling techniques that are independent of the clustering method are used for the precise performance evaluation. We provide variations to incremental growing neural gas algorithm exploiting in an incremental way knowledge from clusters about their current labeling along with cluster distance measure data. This solution leads to significant gain in performance for all types of datasets, especially for the clustering of complex heterogeneous textual data.