Cluster Cores-Based Clustering for High Dimensional Data

  • Authors:
  • Yi-Dong Shen;Zhi-Yong Shen;Shi-Ming Zhang;Qiang Yang

  • Affiliations:
  • Chinese Academy of Sciences, China;Chinese Academy of Sciences, China;Chinese Academy of Sciences, China;Hong Kong Univ. of Sci. & Technology, Hong Kong

  • Venue:
  • ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a new approach to clustering high dimensional data based on a novel notion of cluster cores, instead of on nearest neighbors. A cluster core is a fairly dense group with a maximal number of pairwise similar objects. It represents the core of a cluster, as all objects in a cluster are with a great degree attracted to it. As a result, building clusters from cluster cores achieves high accuracy. Other major characteristics of the approach include: (1) It uses a semantics-based similarity measure. (2) It does not incur the curse of dimensionality and is scalable linearly with the dimensionality of data. (3) It outperforms the well-known clustering algorithm, ROCK, with both lower time complexity and higher accuracy.