Beyond uniformity and independence: analysis of R-trees using the concept of fractal dimension
PODS '94 Proceedings of the thirteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The SR-tree: an index structure for high-dimensional nearest neighbor queries
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Automatic subspace clustering of high dimensional data for data mining applications
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
The pyramid-technique: towards breaking the curse of dimensionality
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
OPTICS: ordering points to identify the clustering structure
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Fast algorithms for projected clustering
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Entropy-based subspace clustering for mining numerical data
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
ACM Computing Surveys (CSUR)
Finding generalized projected clusters in high dimensional spaces
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Using the fractal dimension to cluster datasets
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering through decision tree construction
Proceedings of the ninth international conference on Information and knowledge management
Use of the Hough transformation to detect lines and curves in pictures
Communications of the ACM
Principles of data mining
Co-clustering documents and words using bipartite spectral graph partitioning
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering associations with numeric variables
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Formal Concept Analysis: Mathematical Foundations
Formal Concept Analysis: Mathematical Foundations
Clustering by pattern similarity in large data sets
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A Monte Carlo algorithm for fast projective clustering
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Discovering local structure in gene expression data: the order-preserving submatrix problem
Proceedings of the sixth annual international conference on Computational biology
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
The TV-tree: an index structure for high-dimensional data
The VLDB Journal — The International Journal on Very Large Data Bases - Spatial Database Systems
On the 'Dimensionality Curse' and the 'Self-Similarity Blessing'
IEEE Transactions on Knowledge and Data Engineering
Fast Nearest Neighbor Search in High-Dimensional Space
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Biclustering of Expression Data
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Analysis of Gene Expression Microarrays for Phenotype Classification
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
What Is the Nearest Neighbor in High Dimensional Spaces?
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Estimating the Selectivity of Spatial Queries Using the `Correlation' Fractal Dimension
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
The X-tree: An Index Structure for High-Dimensional Data
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Independent Quantization: An Index Compression Technique for High-Dimensional Data Spaces
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Deflating the Dimensionality Curse Using Multiple Fractal Dimensions
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
d-Clusters: Capturing Subspace Correlation in a Large Data Set
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Frequent-Pattern based Iterative Projected Clustering
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
OP-Cluster: Clustering by Tendency in High Dimensional Space
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
MaPle: A Fast Algorithm for Maximal Pattern-based Clustering
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Computing Clusters of Correlation Connected objects
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Subspace clustering for high dimensional data: a review
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Machine Learning
Biclustering Algorithms for Biological Data Analysis: A Survey
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Cluster Analysis for Gene Expression Data: A Survey
IEEE Transactions on Knowledge and Data Engineering
HARP: A Practical Projected Clustering Algorithm
IEEE Transactions on Knowledge and Data Engineering
Density Connected Clustering with Local Subspace Preferences
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Quantitative Association Rules Based on Half-Spaces: An Optimization Approach
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Iterative Projected Clustering by Subspace Mining
IEEE Transactions on Knowledge and Data Engineering
Automated Variable Weighting in k-Means Type Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
On Discovery of Extremely Low-Dimensional Clusters Using Semi-Supervised Projected Clustering
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
CURLER: finding and visualizing nonlinear correlation clusters
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
A Generic Framework for Efficient Subspace Clustering of High-Dimensional Data
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Deriving quantitative models for correlation clusters
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining Hierarchies of Correlation Clusters
SSDBM '06 Proceedings of the 18th International Conference on Scientific and Statistical Database Management
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
P3C: A Robust Projected Clustering Algorithm
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
What Constitutes a Scientific Database?
SSDBM '07 Proceedings of the 19th International Conference on Scientific and Statistical Database Management
On Exploring Complex Relationships of Correlation Clusters
SSDBM '07 Proceedings of the 19th International Conference on Scientific and Statistical Database Management
An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data
IEEE Transactions on Knowledge and Data Engineering
On data mining, compression, and Kolmogorov complexity
Data Mining and Knowledge Discovery
VISA: visual subspace clustering analysis
ACM SIGKDD Explorations Newsletter - Special issue on visual analytics
A Perspective on Cluster Analysis
Statistical Analysis and Data Mining
SCHISM: a new approach to interesting subspace mining
International Journal of Business Intelligence and Data Mining
Knowledge and Information Systems
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Model-based linear manifold clustering
Model-based linear manifold clustering
DUSC: Dimensionality Unbiased Subspace Clustering
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Constrained locally weighted clustering
Proceedings of the VLDB Endowment
A fast algorithm for finding correlation clusters in noise data
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Detection and visualization of subspace cluster hierarchies
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Compressing tags to find interesting media groups
Proceedings of the 18th ACM conference on Information and knowledge management
Detection of orthogonal concepts in subspaces of high dimensional data
Proceedings of the 18th ACM conference on Information and knowledge management
ACM SIGKDD Explorations Newsletter
Subspace and projected clustering: experimental evaluation and analysis
Knowledge and Information Systems
Learning in parallel universes
Data Mining and Knowledge Discovery
Towards subspace clustering on dynamic data: an incremental version of PreDeCon
Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Metric spaces in data mining: applications to clustering
SIGSPATIAL Special
Automatic parameter determination in subspace clustering with gravitation function
Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Exploiting tag and word correlations for improved webpage clustering
SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
A case study on financial ratios via cross-graph quasi-bicliques
Information Sciences: an International Journal
Can shared-neighbor distances defeat the curse of dimensionality?
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Subspace similarity search: efficient k-NN queries in arbitrary subspaces
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Mining relaxed closed subspace clusters
Proceedings of the 48th Annual Southeast Regional Conference
CoDA: interactive cluster based concept discovery
Proceedings of the VLDB Endowment
Document clustering using synthetic cluster prototypes
Data & Knowledge Engineering
Sampling for information and structure preservation when mining large data bases
IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Proceedings of the 14th International Conference on Extending Database Technology
Making interval-based clustering rank-aware
Proceedings of the 14th International Conference on Extending Database Technology
CIM: categorical influence maximization
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
ClustCube: an OLAP-based framework for clustering and mining complex database objects
Proceedings of the 2011 ACM Symposium on Applied Computing
Locality sensitive hashing for sampling-based algorithms in association rule mining
Expert Systems with Applications: An International Journal
Anomaly detection techniques for a web defacement monitoring service
Expert Systems with Applications: An International Journal
BSN: An automatic generation algorithm of social network data
Journal of Systems and Software
Annotated stochastic context free grammars for analysis and synthesis of proteins
EvoBIO'11 Proceedings of the 9th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
BARTMAP: A viable structure for biclustering
Neural Networks
Clustering very large multi-dimensional datasets with MapReduce
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
INCONCO: interpretable clustering of numerical and categorical objects
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Tracing evolving clusters by subspace and value similarity
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
An extension of the PMML standard to subspace clustering models
Proceedings of the 2011 workshop on Predictive markup language modeling
Density based subspace clustering over dynamic data
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
DB-CSC: a density-based approach for subspace clustering in graphs with feature vectors
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Quality of similarity rankings in time series
SSTD'11 Proceedings of the 12th international conference on Advances in spatial and temporal databases
OLAP over continuous domains via density-based hierarchical clustering
KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part II
Scalable density-based subspace clustering
Proceedings of the 20th ACM international conference on Information and knowledge management
External evaluation measures for subspace clustering
Proceedings of the 20th ACM international conference on Information and knowledge management
Fast and flexible unsupervised custering algorithm based on ultrametric properties
Proceedings of the 7th ACM symposium on QoS and security for wireless and mobile networks
Model-based multidimensional clustering of categorical data
Artificial Intelligence
Fast fractal stack: fractal analysis of computed tomography scans of the lung
MMAR '11 Proceedings of the 2011 international ACM workshop on Medical multimedia analysis and retrieval
Information Sciences: an International Journal
Tracing Evolving Subspace Clusters in Temporal Climate Data
Data Mining and Knowledge Discovery
Determining the number of clusters using information entropy for mixed data
Pattern Recognition
Feature interaction in subspace clustering using the Choquet integral
Pattern Recognition
A robust seedless algorithm for correlation clustering
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Partitive clustering (K-means family)
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Clustering high dimensional data
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Visualization of Global Correlation Structures in Uncertain 2D Scalar Fields
Computer Graphics Forum
Social event detection using multimodal clustering and integrating supervisory signals
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
An evolutionary subspace clustering algorithm for high-dimensional data
Proceedings of the 14th annual conference companion on Genetic and evolutionary computation
Leveraging Social Bookmarks from Partially Tagged Corpus for Improved Web Page Clustering
ACM Transactions on Intelligent Systems and Technology (TIST)
Multi-view clustering using mixture models in subspace projections
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining coherent subgraphs in multi-layer graphs with edge labels
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining of temporal coherent subspace clusters in multivariate time series databases
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Automatic aspect discrimination in data clustering
Pattern Recognition
Substructure clustering: a novel mining paradigm for arbitrary data types
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
A survey on unsupervised outlier detection in high-dimensional numerical data
Statistical Analysis and Data Mining
Model-based clustering of high-dimensional data: Variable selection versus facet determination
International Journal of Approximate Reasoning
Parsimonious Mahalanobis kernel for the classification of high dimensional data
Pattern Recognition
Density-Based projected clustering of data streams
SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Semantic-preservingword clouds by seam carving
EuroVis'11 Proceedings of the 13th Eurographics / IEEE - VGTC conference on Visualization
Visualizing high-dimensional structures by dimension ordering and filtering using subspace analysis
EuroVis'11 Proceedings of the 13th Eurographics / IEEE - VGTC conference on Visualization
A New Locally Weighted K-Means for Cancer-Aided Microarray Data Analysis
Journal of Medical Systems
Enhancing density-based clustering: Parameter reduction and outlier detection
Information Systems
A survey on enhanced subspace clustering
Data Mining and Knowledge Discovery
A generalized cluster centroid based classifier for text categorization
Information Processing and Management: an International Journal
Projective clustering ensembles
Data Mining and Knowledge Discovery
Chaotic neural network for biometric pattern recognition
Advances in Artificial Intelligence - Special issue on Learning Approaches for Biometric Identification and Verification
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Interactive data mining with 3D-parallel-coordinate-trees
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Finding the most descriptive substructures in graphs with discrete and numeric labels
NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
RMiCS: a robust approach for mining coherent subgraphs in edge-labeled multi-layer graphs
Proceedings of the 25th International Conference on Scientific and Statistical Database Management
CopyCatch: stopping group attacks by spotting lockstep behavior in social networks
Proceedings of the 22nd international conference on World Wide Web
Finding contexts of social influence in online social networks
Proceedings of the 7th Workshop on Social Network Mining and Analysis
Configurations and couplings: an exploratory study
ICDM'13 Proceedings of the 13th international conference on Advances in Data Mining: applications and theoretical aspects
GPUMAFIA: efficient subspace clustering with MAFIA on GPUs
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Mining order-preserving submatrices from probabilistic matrices
ACM Transactions on Database Systems (TODS)
Fast cartography for data explorers
Proceedings of the VLDB Endowment
Hierarchical co-clustering: off-line and incremental approaches
Data Mining and Knowledge Discovery
Shape classification by manifold learning in multiple observation spaces
Information Sciences: an International Journal
Subspace clustering of high-dimensional data: a predictive approach
Data Mining and Knowledge Discovery
Semi-supervised projected model-based clustering
Data Mining and Knowledge Discovery
Hi-index | 0.01 |
As a prolific research area in data mining, subspace clustering and related problems induced a vast quantity of proposed solutions. However, many publications compare a new proposition—if at all—with one or two competitors, or even with a so-called “naïve” ad hoc solution, but fail to clarify the exact problem definition. As a consequence, even if two solutions are thoroughly compared experimentally, it will often remain unclear whether both solutions tackle the same problem or, if they do, whether they agree in certain tacit assumptions and how such assumptions may influence the outcome of an algorithm. In this survey, we try to clarify: (i) the different problem definitions related to subspace clustering in general; (ii) the specific difficulties encountered in this field of research; (iii) the varying assumptions, heuristics, and intuitions forming the basis of different approaches; and (iv) how several prominent solutions tackle different problems.