CoFD: An Algorithm for Non-distance Based Clustering in High Dimensional Spaces

  • Authors:
  • Shenghuo Zhu;Tao Li;Mitsunori Ogihara

  • Affiliations:
  • -;-;-

  • Venue:
  • DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The clustering problem, which aims at identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity clusters, has been widely studied. Traditional clustering algorithms use distance functions to measure similarity and are not suitable for high dimensional spaces. In this paper, we propose CoFD algorithm, which is a non-distance based clustering algorithm for high dimensional spaces. Based on the maximum likelihood principle, CoFD is to optimize parameters to maximize the likelihood between data points and the model generated by the parameters. Experimental results on both synthetic data sets and a real data set show the efficiency and effectiveness of CoFD.