Categorical data visualization and clustering using subjective factors

  • Authors:
  • Chia-Hui Chang;Zhi-Kai Ding

  • Affiliations:
  • Department of Computer Science and Information Engineering, National Central University, No. 300, Jhungda Road, Jhungli City, Taoyuan 320, Taiwan;Department of Computer Science and Information Engineering, National Central University, No. 300, Jhungda Road, Jhungli City, Taoyuan 320, Taiwan

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering is an important data mining problem. However, most earlier work on clustering focused on numeric attributes which have a natural ordering to their attribute values. Recently, clustering data with categorical attributes, whose attribute values do not have a natural ordering, has received more attention. A common issue in cluster analysis is that there is no single correct answer to the number of clusters, since cluster analysis involves human subjective judgement. Interactive visualization is one of the methods where users can decide a proper clustering parameters. In this paper, a new clustering approach called CDCS (Categorical Data Clustering with Subjective factors) is introduced, where a visualization tool for clustered categorical data is developed such that the result of adjusting parameters is instantly reflected. The experiment shows that CDCS generates high quality clusters compared to other typical algorithms.