Clustering Data without Prior Knowledge

  • Authors:
  • Javed A. Aslam;Alain Leblanc;Clifford Stein

  • Affiliations:
  • -;-;-

  • Venue:
  • WAE '00 Proceedings of the 4th International Workshop on Algorithm Engineering
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a new approach for clustering a data set for which the only information available is a similarity measure between every pair of elements. The objective is to partition the set into disjoint subsets such that two elements assigned to the same subset are more likely to have a high similarity measure than elements assigned to different subsets. The algorithm makes no assumption about the size or number of clusters, or of any constraint in the similarity measure. The algorithm relies on very simple operations. The running time is dominated by matrix multiplication, and in some cases curve-fitting. We will present experimental results from various implementations of this method.