Resampling Method for Unsupervised Estimation of Cluster Validity

  • Authors:
  • Erel Levine;Eytan Domany

  • Affiliations:
  • Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot 76100, Israel;Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot 76100, Israel

  • Venue:
  • Neural Computation
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a method for validation of results obtained by clustering analysis of data. The method is based on resampling the available data. A figure of merit that measures the stability of clustering solutions against resampling is introduced. Clusters that are stable against resampling give rise to local maxima of this figure of merit. This is presented first for a one-dimensional data set, for which an analytic approximation for the figure of merit is derived and compared with numerical measurements. Next, the applicability of the method is demonstrated for higher-dimensional data, including gene microarray expression data.