PCA-Guided k-Means with Variable Weighting and Its Application to Document Clustering

  • Authors:
  • Katsuhiro Honda;Akira Notsu;Hidetomo Ichihashi

  • Affiliations:
  • Osaka prefecture University, Osaka, Japan 599-8531;Osaka prefecture University, Osaka, Japan 599-8531;Osaka prefecture University, Osaka, Japan 599-8531

  • Venue:
  • MDAI '09 Proceedings of the 6th International Conference on Modeling Decisions for Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

PCA-guided k -Means is a deterministic approach to k -Means clustering, in which cluster indicators are derived in a PCA-guided manner. This paper proposes a new approach to k -Means with variable selection by introducing variable weighting mechanism into PCA-guided k -Means. The relative responsibility of variables is estimated in a similar way with FCM clustering while the membership indicator is derived from a PCA-guided manner, in which the principal component scores are calculated by considering the responsibility weights of variables. So, the variables that have meaningful information for capturing cluster structures are emphasized in calculation of membership indicators. Numerical experiments including an application to document clustering demonstrate the characteristics of the proposed method.