Techniques for measuring the stability of clustering: a comparative study

  • Authors:
  • Vijay V. Raghavan;M. Y. L. Ip

  • Affiliations:
  • University of Regina, Regina, Sask., Canada;Datatron Corp., Lethbridge, Alta., Canada

  • Venue:
  • SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
  • Year:
  • 1982

Quantified Score

Hi-index 0.00

Visualization

Abstract

Among the significant factors in assessing the suitability of a clustering technique to a given application is its stability; that is, how sensitive the algorithm is to perturbations in the input data. A number of techniques that appear to be suitable for measuring the stability of clustering have been published in the literature. The details about each of these measures, such as a description of the steps involved in their computation and an identification of precisely what they measure, are presented. These measures are considered in the context of analysing the stability characteristics of clustering techniques and are compared using a framework developed for this purpose. The question of generalizing some of these measures is addressed and the measures are also analyzed to identify conditions under which they can be reduced to one another.