A stability based validity method for fuzzy clustering

  • Authors:
  • M. Falasconi;A. Gutierrez;M. Pardo;G. Sberveglieri;S. Marco

  • Affiliations:
  • SENSOR Laboratory, Department of Chemistry and Physics for Engineering and Materials, University of Brescia and INFM-CNR, Via Valotti 9, I-25123 Brescia, Italy;Departament d'Electrònica, Universitat de Barcelona, Martí i Franquès, 1, 08028 Barcelona, Spain and Artificial Olfaction Group, Institute for Bioengineering of Catalonia (IBEC), Ba ...;SENSOR Laboratory, Department of Chemistry and Physics for Engineering and Materials, University of Brescia and INFM-CNR, Via Valotti 9, I-25123 Brescia, Italy and Computational Molecular Biology ...;SENSOR Laboratory, Department of Chemistry and Physics for Engineering and Materials, University of Brescia and INFM-CNR, Via Valotti 9, I-25123 Brescia, Italy;Departament d'Electrònica, Universitat de Barcelona, Martí i Franquès, 1, 08028 Barcelona, Spain and Artificial Olfaction Group, Institute for Bioengineering of Catalonia (IBEC), Ba ...

  • Venue:
  • Pattern Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

An important goal in cluster analysis is the internal validation of results using an objective criterion. Of particular relevance in this respect is the estimation of the optimum number of clusters capturing the intrinsic structure of your data. This paper proposes a method to determine this optimum number based on the evaluation of fuzzy partition stability under bootstrap resampling. The method is first characterized on synthetic data with respect to hyper-parameters, like the fuzzifier, and spatial clustering parameters, such as feature space dimensionality, clusters degree of overlap, and number of clusters. The method is then validated on experimental datasets. Furthermore, the performance of the proposed method is compared to that obtained using a number of traditional fuzzy validity rules based on the cluster compactness-to-separation criteria. The proposed method provides accurate and reliable results, and offers better generalization capabilities than the classical approaches.