A Fast Subspace Clustering Algorithm Based on Pattern Similarity

  • Authors:
  • Yanglan Gan;Jihong Guan;Hao Wang

  • Affiliations:
  • Tongji University;Tongji University;Hefei University of Technology

  • Venue:
  • FSKD '07 Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 03
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Traditional clustering models define similarity by distance over dimensions. However, distance functions are not always adequate in capturing correlations among the objects. Pattern-based clustering can discover this kind of clusters. But state-of-the-art pattern-based clustering methods are inefficient and haven't criteria to evaluate the quality of clusters. This paper presents a novel pattern similarity-based subspace clustering with the pattern tree (PPSC for short) that offers these capabilities. The method uses new evaluation criteria to discover best clusters, which enables user to find clusters according to different needs. Meanwhile, observing the analogy between mining frequent itemsets and discovering subspace clusters around random points, we apply the pattern-tree to determine subspace by scanning the database once, so it can perform efficiently in large datasets.