Clustering Item Data Sets with Association-Taxonomy Similarity

  • Authors:
  • Ching-Huang Yun;Kun-Ta Chuang;Ming-Syan Chen

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We explore in this paper the efficient clustering of itemdata. Different from those of the traditional data, the featuresof item data are known to be of high dimensionalityand sparsity. In view of the features of item data, we devisein this paper a novel measurement, called the association-taxonomysimilarity, and utilize this measurement to performthe clustering. With this association-taxonomy similaritymeasurement, we develop an efficient clustering algorithm,called algorithm AT (standing for Association-Taxonomy),for item data. Two validation indexes basedon association and taxonomy properties are also devised toassess the quality of clustering for item data. As validatedby the real dataset, it is shown by our experimental resultsthat algorithm AT devised in this paper significantly outperformsthe prior works in the clustering quality as measuredby the validation indexes, indicating the usefulness ofassociation-taxonomy similarity in item data clustering.