Scalable density-based subspace clustering

  • Authors:
  • Emmanuel Müller;Ira Assent;Stephan Günnemann;Thomas Seidl

  • Affiliations:
  • Karlsruhe Institute of Technology, Karlsruhe , Germany;Aarhus University, Aarhus, Denmark;RWTH Aachen University, Aachen , Germany;RWTH Aachen University, Aachen , Germany

  • Venue:
  • Proceedings of the 20th ACM international conference on Information and knowledge management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

For knowledge discovery in high dimensional databases, subspace clustering detects clusters in arbitrary subspace projections. Scalability is a crucial issue, as the number of possible projections is exponential in the number of dimensions. We propose a scalable density-based subspace clustering method that steers mining to few selected subspace clusters. Our novel steering technique reduces subspace processing by identifying and clustering promising subspaces and their combinations directly. Thereby, it narrows down the search space while maintaining accuracy. Thorough experiments on real and synthetic databases show that steering is efficient and scalable, with high quality results. For future work, our steering paradigm for density-based subspace clustering opens research potential for speeding up other subspace clustering approaches as well.