Finding natural clusters using multi-clusterer combiner based on shared nearest neighbors

  • Authors:
  • Hanan Ayad;Mohamed Kamel

  • Affiliations:
  • Pattern Analysis and Machine Intelligence Lab, Systems Design Engineering, University of Waterloo, Waterloo, Ontario, Canada;Pattern Analysis and Machine Intelligence Lab, Systems Design Engineering, University of Waterloo, Waterloo, Ontario, Canada

  • Venue:
  • MCS'03 Proceedings of the 4th international conference on Multiple classifier systems
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a multiple data clusterings combiner, based on a proposed Weighted Shared nearest neighbors Graph (WSnnG). While combining of multiple classifiers (supervised learners) is now an active and mature area, only a limited number of contemporary research in combining multiple data clusterings (unsupervised learners) appear in the literature. The problem addressed in this paper is that of generating a reliable clustering to represent the natural cluster structure in a set of patterns, when a number of different clusterings of the data is available or can be generated. The underlying model of the proposed shared nearest neighbors based combiner is a weighted graph, whose vertices correspond to the set of patterns, and are assigned relative weights based on a ratio of a balancing factor to the size of their shared nearest neighbors population. The edges in the graph exist only between patterns that share a pre-specified portion of their nearest neighborhood. The graph can be further partitioned into a desired number of clusters. Preliminary experiments show promising results, and comparison with a recent study justifies the combiner's suitability to the pre-defined problem domain.