A New Supervised Clustering Algorithm for Data Set with Mixed Attributes

  • Authors:
  • Shijin Li;Jing Liu;Yuelong Zhu;Xiaohua Zhang

  • Affiliations:
  • Hohai University, China;Hohai University, China;Hohai University, China;Hohai University, China

  • Venue:
  • SNPD '07 Proceedings of the Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing - Volume 02
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Because of the complexity of data set with mixed attributes, the traditional clustering algorithms appropriate for this kind of dataset are few and the effect of clustering is not good. K-prototype clustering is one of the most commonly used methods in data mining for this kind of data. We borrow the ideas from the multiple classifiers combing technology, use k-prototype as the basis clustering algorithm to design a multi-level clustering ensemble algorithm in this paper, which adaptively selects attributes for re-clustering. Comparison experiments on Adult data set from UCI machine learning data repository show very competitive results and the proposed method is suitable for data editing.