Approximation techniques for clustering dissimilarity data

  • Authors:
  • Xibin Zhu;Andrej Gisbrecht;Frank-Michael Schleif;Barbara Hammer

  • Affiliations:
  • Bielefeld University, CITEC Centre of Excellence, D-33594 Bielefeld, Germany;Bielefeld University, CITEC Centre of Excellence, D-33594 Bielefeld, Germany;Bielefeld University, CITEC Centre of Excellence, D-33594 Bielefeld, Germany;Bielefeld University, CITEC Centre of Excellence, D-33594 Bielefeld, Germany

  • Venue:
  • Neurocomputing
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

Recently, diverse high quality prototype-based clustering techniques have been developed which can directly deal with data sets given by general pairwise dissimilarities rather than standard Euclidean vectors. Examples include affinity propagation, relational neural gas, or relational generative topographic mapping. Corresponding to the size of the dissimilarity matrix, these techniques scale quadratically with the size of the training set, such that training becomes prohibitive for large data volumes. In this contribution, we investigate two different linear time approximation techniques, patch processing and the Nystrom approximation. We apply these approximations to several representative clustering techniques for dissimilarities, where possible, and compare the results for diverse data sets.