Clustering categorical data using coverage density

  • Authors:
  • Hua Yan;Lei Zhang;Yi Zhang

  • Affiliations:
  • Computational Intelligence Laboratory, School of Computer Science and Engineering, University of Electronic Science & Technology of China, Chengdu, P.R. China;Computational Intelligence Laboratory, School of Computer Science and Engineering, University of Electronic Science & Technology of China, Chengdu, P.R. China;Computational Intelligence Laboratory, School of Computer Science and Engineering, University of Electronic Science & Technology of China, Chengdu, P.R. China

  • Venue:
  • ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a new algorithm based on the idea of coverage density is proposed for clustering categorical data. It uses average coverage density as the global criterion function. Large sparse categorical databases can be clustered effectively by using this algorithm. It shows that the algorithm uses less memory and time by analyzing its time and space complexity. Experiments on two real datasets are carried out to illustrate the performance of the proposed algorithm.