A hybrid incremental clustering method-combining support vector machine and enhanced clustering by committee clustering algorithm

  • Authors:
  • Deng-Yiv Chiu;Kong-Ling Hsieh

  • Affiliations:
  • Department of Information Management, ChungHua University, Hsin-Chu, Taiwan, R.O.C.;Department of Information Management, ChungHua University, Hsin-Chu, Taiwan, R.O.C.

  • Venue:
  • PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the study, a new hybrid incremental clustering method is proposed in combination with Support Vector Machine (SVM) and enhanced Clustering by Committee (CBC) algorithm. SVM classifies the incoming document to see if it belongs to the existing classes. Then the enhanced CBC algorithm is used to cluster the unclassified documents. SVM can significantly reduce the amount of calculation and the noise of clustering. The enhanced CBC algorithm can effectively control the number of clusters, improve performance and allow the number of classes to grow gradually based on the structure of current classes without clustering all of documents again. In empirical results, the proposed method outperforms the enhanced CBC clustering method and other algorithms. Also, the enhanced CBC clustering method outperforms original CBC.