Cluster validity methods: part I

  • Authors:
  • Maria Halkidi;Yannis Batistakis;Michalis Vazirgiannis

  • Affiliations:
  • Athens University of Economics & Business;Athens University of Economics & Business;Athens University of Economics & Business

  • Venue:
  • ACM SIGMOD Record
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering is an unsupervised process since there are no predefined classes and no examples that would indicate grouping properties in the data set. The majority of the clustering algorithms behave differently depending on the features of the data set and the initial assumptions for defining groups. Therefore, in most applications the resulting clustering scheme requires some sort of evaluation as regards its validity. Evaluating and assessing the results of a clustering algorithm is the main subject of cluster validity. In this paper we present a review of the clustering validity and methods. More specifically, Part I of the paper discusses the cluster validity approaches based on external and internal criteria.