Clicks: An effective algorithm for mining subspace clusters in categorical datasets

  • Authors:
  • Mohammed J. Zaki;Markus Peters;Ira Assent;Thomas Seidl

  • Affiliations:
  • Department of Computer Science, Rensselaer Polytechnic Institute, Lally 307, 110 8th St., Troy, NY 12180-3590, United States;RWTH-Aachen, Germany;RWTH-Aachen, Germany;RWTH-Aachen, Germany

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a novel algorithm called Clicks, that finds clusters in categorical datasets based on a search for k-partite maximal cliques. Unlike previous methods, Clicks mines subspace clusters. It uses a selective vertical method to guarantee complete search. Clicks outperforms previous approaches by over an order of magnitude and scales better than any of the existing method for high-dimensional datasets. These results are demonstrated in a comprehensive performance study on real and synthetic datasets.