An indication of unification for different clustering approaches

  • Authors:
  • Jyrko Correa-Morris

  • Affiliations:
  • Applied Mathematics Department, Faculty of Mathematics, Havana University, Cuba and Institute for Pure and Applied Mathematics (IMPA), Rio de Janeiro, Brazil

  • Venue:
  • Pattern Recognition
  • Year:
  • 2013

Quantified Score

Hi-index 0.01

Visualization

Abstract

The question of finding generic concepts and properties common to the different clustering approaches is a current problem. This inquire is addressed most thoroughly in Kleinberg's paper on the Impossibility Theorem (see [1]). Kleinberg introduced the notion of clustering function - a function that takes a dissimilarity measure defined on a data set S and returns a partition of S; and a set of simple properties for the study of such functions - Scale Invariance, Richness and Consistency. The main result of [1] is the Impossibility Theorem: there is no clustering method satisfying all these properties. This study has been accepted as a rigorous proof of the difficulty in finding a unified framework for different clustering approaches. Our goal in this paper is to provide primary concepts and results for the formal study of the various clustering approaches. To accomplish this, we discuss and expand on the ideas introduced by Kleinberg. Our guiding philosophy is to incorporate a crucial fact overlooked in the study conducted in [1] - clustering methods not only depend on the dissimilarity measure but also on other parameters such as dissimilarity thresholds, centroids, stop criteria, among others. This paper gives a formal definition of clustering method and reformulates the afore-mentioned properties, even it introduces some new. Contrary to the result obtained in [1], many of the methods discussed here satisfy all of our properties. With all these grounds in hand we glimpse a clue of unification among the different clustering approaches.