Data Resampling for Path Based Clustering

  • Authors:
  • Bernd Fischer;Joachim M. Buhmann

  • Affiliations:
  • -;-

  • Venue:
  • Proceedings of the 24th DAGM Symposium on Pattern Recognition
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Path Based Clustering assigns two objects to the same cluster if they are connected by a path with high similarity between adjacent objects on the path. In this paper, we propose a fast agglomerative algorithm to minimize the Path Based Clustering cost function. To enhance the reliability of the clustering results a stochastic resampling method is used to generate candidate solutions which are merged to yield empirical assignment probabilities of objects to clusters. The resampling algorithm measures the reliability of the clustering solution and, based on their stability, determines the number of clusters.