An efficient algorithm for computing the distance between close partitions

  • Authors:
  • Daniel Cosmin Porumbel;Jin Kao Hao;Pascale Kuntz

  • Affiliations:
  • Univ. Lille-Nord de France, UArtois, LGI2A, Technoparc Futura 62400, Béthune, France;LERIA, Université d'Angers, 2 Bd Lavoisier, 49045 Angers, France;LINA, Polytech'Nantes, rue Christian Pauc, 44306 Nantes, France

  • Venue:
  • Discrete Applied Mathematics
  • Year:
  • 2011

Quantified Score

Hi-index 0.04

Visualization

Abstract

A K-partition of a set S is a splitting of S into K non-overlapping classes that cover all elements of S. Numerous practical applications dealing with data partitioning or clustering require computing the distance between two partitions. Previous articles proved that one can compute it in polynomial time-minimum O(|S|+K^2) and maximum O(|S|+K^3)-using a reduction to the linear assignment problem. We propose several conditions for which the partition distance can be computed in O(|S|) time. In practical terms, this computation can be done in O(|S|) time for any two relatively resembling partitions (i.e. with distance less than |S|5) except specially constructed cases. Finally, we prove that, even if there is a bounded number of classes for which the proposed conditions are not satisfied, one can still preserve the linear complexity by exploiting decomposition properties of the similarity matrix.