A new clusterwise similarity for partitions based on quantitative disagreement

  • Authors:
  • P. Rajasekhara;Arun K. Pujari

  • Affiliations:
  • CA Technologies Private Ltd., Nanakramguda, Gachibowli, Hyderabad;Sambalpur University, Sambalpur

  • Venue:
  • Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Combining the results of different clustering to get a consensus clustering has attracted the attention of data mining researchers. In this context, it becomes necessary to measure diversity (or similarity) of a pair of partitions. Several diversity indices exist and these are based either on pairwise agreement or on clusterwise agreement. In pairwise agreement approach, similarity of two clusters is the number of common pairs of data elements. However, it is equally important to measure the level of disagreement rather than counting the frequency of disagreed pairs. We formulate this problem as a Transportation Problem and use Northwest Corner rule to compute feasible significance measures. We use this idea to propose a new index which differs from the existing measures in evaluating the extent of agreement by measuring the disagreement of data-pairs in terms of distance between cluster-pair of the disagreed data. We show experimentally that this yields a far better diversity index.