Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data

Authors:
Emmanuel Müller;Ira Assent;Stephan Günnemann;Ralph Krieger;Thomas Seidl
Affiliations:
-;-;-;-;-
Venue:
ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Year:
2009

Citing 0
Cited 12

Adaptive outlierness for subspace outlier ranking

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Subspace clustering for indexing high dimensional data: a main memory index based on local reductions and individual multi-representations

Proceedings of the 14th International Conference on Extending Database Technology
INCONCO: interpretable clustering of numerical and categorical objects

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
An extension of the PMML standard to subspace clustering models

Proceedings of the 2011 workshop on Predictive markup language modeling
Efficient selectivity estimation by histogram construction based on subspace clustering

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Scalable density-based subspace clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
External evaluation measures for subspace clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
Subspace clustering

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Mining coherent subgraphs in multi-layer graphs with edge labels

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Substructure clustering: a novel mining paradigm for arbitrary data types

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
A survey on enhanced subspace clustering

Data Mining and Knowledge Discovery
Projective clustering ensembles

Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

Subspace clustering aims at detecting clusters in any subspace projection of a high dimensional space. As the number of possible subspace projections is exponential in the number of dimensions, the result is often tremendously large. Recent approaches fail to reduce results to relevant subspace clusters. Their results are typically highly redundant, i.e. many clusters are detected multiple times in several projections. In this work, we propose a novel model for relevant subspace clustering (RESCU). We present a global optimization which detects the most interesting non-redundant subspace clusters. We prove that computation of this model is NP-hard. For RESCU, we propose an approximative solution that shows high accuracy with respect to our relevance model. Thorough experiments on synthetic and real world data show that RESCU successfully reduces the result to manageable sizes. It reliably achieves top clustering quality while competing approaches show greatly varying performance.