Quality indices for (practical) clustering evaluation

  • Authors:
  • Margarida G. M. S. Cardoso;André Ponce de Leon F. de Carvalho

  • Affiliations:
  • (Correspd. E-mail: margarida.cardoso@iscte.pt) Department of Quantitative Methods, ISCTE Business School, Av. das Forças Armadas, 1649-026, Lisboa, Portugal;Department of Computer Science, Institute of Mathematics and Computer Science, University of São Paulo, Av. Trabalhador Sãocarlense, 400, CEP 13560-970, São Carlos, SP, Brazil

  • Venue:
  • Intelligent Data Analysis
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering quality or validation indices allow the evaluation of the quality of clustering in order to support the selection of a specific partition or clustering structure in its natural unsupervised environment, where the real solution is unknown or not available. In this paper, we investigate the use of quality indices mostly based on the concepts of clusters' compactness and separation, for the evaluation of clustering results (partitions in particular). This work intends to offer a general perspective regarding the appropriate use of quality indices for the purpose of clustering evaluation. After presenting some commonly used indices, as well as indices recently proposed in the literature, key issues regarding the practical use of quality indices are addressed. A general methodological approach is presented which considers the identification of appropriate indices thresholds. This general approach is compared with the simple use of quality indices for evaluating a clustering solution.