Algorithms for clustering data
Algorithms for clustering data
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Two algorithms for nearest-neighbor search in high dimensions
STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Automatic subspace clustering of high dimensional data for data mining applications
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Dimensionality reduction for similarity searching in dynamic databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Approximate nearest neighbors: towards removing the curse of dimensionality
STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Fast algorithms for projected clustering
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Entropy-based subspace clustering for mining numerical data
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
A Distribution-Based Clustering Algorithm for Mining in Large Spatial Databases
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Similarity Search in High Dimensions via Hashing
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
On the merits of building categorization systems by supervised clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering through decision tree construction
Proceedings of the ninth international conference on Information and knowledge management
Outlier detection for high dimensional data
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Systems support for scalable data mining
ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
On the effects of dimensionality reduction on high dimensional similarity search
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A human-computer cooperative system for effective high dimensional clustering
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Towards effective and interpretable data mining by visual interaction
ACM SIGKDD Explorations Newsletter
The convex polyhedra technique: an index structure for high-dimensional space
ADC '02 Proceedings of the 13th Australasian database conference - Volume 5
Clustering by pattern similarity in large data sets
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A Monte Carlo algorithm for fast projective clustering
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
FREM: fast and robust EM clustering for large data sets
Proceedings of the eleventh international conference on Information and knowledge management
Finding Localized Associations in Market Basket Data
IEEE Transactions on Knowledge and Data Engineering
Redefining Clustering for High-Dimensional Applications
IEEE Transactions on Knowledge and Data Engineering
Using Projections to Visually Cluster High-Dimensional Data
Computing in Science and Engineering
OLAP-Based Data Mining for Business Intelligence Applications in Telecommunications and E-commerce
DNIS '00 Proceedings of the International Workshop on Databases in Networked Information Systems
What Is the Nearest Neighbor in High Dimensional Spaces?
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
C2P: Clustering based on Closest Pairs
Proceedings of the 27th International Conference on Very Large Data Bases
C2VA: Trim High Dimensional Indexes
WAIM '02 Proceedings of the Third International Conference on Advances in Web-Age Information Management
CoFD: An Algorithm for Non-distance Based Clustering in High Dimensional Spaces
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
Approximation Algorithms for k-Line Center
ESA '02 Proceedings of the 10th Annual European Symposium on Algorithms
IEEE Transactions on Knowledge and Data Engineering
Attentional Object Spotting by Integrating Multimodal Input
ICMI '02 Proceedings of the 4th IEEE International Conference on Multimodal Interfaces
Clustering binary data streams with K-means
DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Analyzing High-Dimensional Data by Subspace Validity
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Frequent-Pattern based Iterative Projected Clustering
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
OP-Cluster: Clustering by Tendency in High Dimensional Space
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
MaPle: A Fast Algorithm for Maximal Pattern-based Clustering
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
A multimodal learning interface for grounding spoken language in sensory perceptions
Proceedings of the 5th international conference on Multimodal interfaces
A Human-Computer Interactive Method for Projected Clustering
IEEE Transactions on Knowledge and Data Engineering
Information Visualization - Special issue on coordinated and multiple views in exploratory visualization
Outlier analysis for gene expression data
Journal of Computer Science and Technology - Special issue on bioinformatics
Hypergraph Models and Algorithms for Data-Pattern-Based Clustering
Data Mining and Knowledge Discovery
Computing Clusters of Correlation Connected objects
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Subspace clustering for high dimensional data: a review
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Efficient Disk-Based K-Means Clustering for Relational Databases
IEEE Transactions on Knowledge and Data Engineering
A framework for ontology-driven subspace clustering
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Programming the K-means clustering algorithm in SQL
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
IEEE Transactions on Knowledge and Data Engineering
HARP: A Practical Projected Clustering Algorithm
IEEE Transactions on Knowledge and Data Engineering
Iterative Projected Clustering by Subspace Mining
IEEE Transactions on Knowledge and Data Engineering
Identifying projected clusters from gene expression profiles
Journal of Biomedical Informatics
Subspace clustering for high dimensional categorical data
ACM SIGKDD Explorations Newsletter
Projective Clustering by Histograms
IEEE Transactions on Knowledge and Data Engineering
On Discovery of Extremely Low-Dimensional Clusters Using Semi-Supervised Projected Clustering
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
An effective and efficient algorithm for high-dimensional outlier detection
The VLDB Journal — The International Journal on Very Large Data Bases
CURLER: finding and visualizing nonlinear correlation clusters
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
A novel grammar-based genetic programming approach to clustering
Proceedings of the 2005 ACM symposium on Applied computing
Automatic Subspace Clustering of High Dimensional Data
Data Mining and Knowledge Discovery
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Feature bagging for outlier detection
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Cross-relational clustering with user's guidance
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Integrating K-Means Clustering with a Relational DBMS Using SQL
IEEE Transactions on Knowledge and Data Engineering
Comparing Subspace Clusterings
IEEE Transactions on Knowledge and Data Engineering
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
On the use of Human-Computer Interaction for Projected Nearest Neighbor Search
Data Mining and Knowledge Discovery
Deriving quantitative models for correlation clusters
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Robust information-theoretic clustering
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Putting context into schema matching
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Projective clustering using itemset discovery for multi-dimensional data analysis
MS'06 Proceedings of the 17th IASTED international conference on Modelling and simulation
Vector and matrix operations programmed with UDFs in a relational DBMS
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Clicks: An effective algorithm for mining subspace clusters in categorical datasets
Data & Knowledge Engineering
A dimensionality reduction algorithm and its application for interactive visualization
Journal of Visual Languages and Computing
Locally adaptive metrics for clustering high dimensional data
Data Mining and Knowledge Discovery
Bi-criteria linear-time approximations for generalized k-mean/median/center
SCG '07 Proceedings of the twenty-third annual symposium on Computational geometry
Linear manifold clustering in high dimensional spaces by stochastic search
Pattern Recognition
An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data
IEEE Transactions on Knowledge and Data Engineering
An automated system for web portal personalization
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
RIC: Parameter-free noise-robust clustering
ACM Transactions on Knowledge Discovery from Data (TKDD)
COMBI-operator - database support for data mining applications
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Learning correlations using the mixture-of-subsets model
ACM Transactions on Knowledge Discovery from Data (TKDD)
A clustering framework based on subjective and objective validity criteria
ACM Transactions on Knowledge Discovery from Data (TKDD)
Continuous subspace clustering in streaming time series
Information Systems
Mining approximate top-k subspace anomalies in multi-dimensional time-series data
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Random walk biclustering for microarray data
Information Sciences: an International Journal
SCHISM: a new approach to interesting subspace mining
International Journal of Business Intelligence and Data Mining
Mining multiple-level fuzzy blocks from multidimensional data
Fuzzy Sets and Systems
Outlier-robust clustering using independent components
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
ELKI: A Software System for Evaluation of Subspace Clustering Algorithms
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Constrained locally weighted clustering
Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment
REDUS: finding reducible subspaces in high dimensional data
Proceedings of the 17th ACM conference on Information and knowledge management
EDSC: efficient density-based subspace clustering
Proceedings of the 17th ACM conference on Information and knowledge management
ACM Transactions on Knowledge Discovery from Data (TKDD)
Efficient layered density-based clustering of categorical data
Journal of Biomedical Informatics
SLICE: A Novel Method to Find Local Linear Correlations by Constructing Hyperplanes
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Clustering by pattern similarity
Journal of Computer Science and Technology
Models for association rules based on clustering and correlation
Intelligent Data Analysis
Query result clustering for object-level search
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A semi-supervised approach to projected clustering with applications to microarray data
International Journal of Data Mining and Bioinformatics
An Effective Dimension Reduction Approach to Chinese Document Classification Using Genetic Algorithm
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Subspace sums for extracting non-random data from massive noise
Knowledge and Information Systems
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Subspace and projected clustering: experimental evaluation and analysis
Knowledge and Information Systems
Evaluating clustering in subspace projections of high dimensional data
Proceedings of the VLDB Endowment
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
SKM-SNP: SNP markers detection method
Journal of Biomedical Informatics
A new clustering algorithm for transaction data via caucus
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
A fast algorithm for finding correlation clusters in noise data
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Detection and visualization of subspace cluster hierarchies
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Learning in parallel universes
Data Mining and Knowledge Discovery
Automatic parameter determination in subspace clustering with gravitation function
Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Can shared-neighbor distances defeat the curse of dimensionality?
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Proceedings of the 14th International Conference on Extending Database Technology
Pattern Recognition Letters
The role of hubness in clustering high-dimensional data
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
INCONCO: interpretable clustering of numerical and categorical objects
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Employing correlation clustering for the identification of piecewise affine models
Proceedings of the 2011 workshop on Knowledge discovery, modeling and simulation
Hybrid-LWM: A linear-model based hybrid clustering algorithm for supplier categorisation
International Journal of Systems, Control and Communications
Scalable density-based subspace clustering
Proceedings of the 20th ACM international conference on Information and knowledge management
External evaluation measures for subspace clustering
Proceedings of the 20th ACM international conference on Information and knowledge management
CLINCH: clustering incomplete high-dimensional data for data mining application
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
A near-linear algorithm for projective clustering integer points
Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
An incremental updating method for clustering-based high-dimensional data indexing
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Generalized projected clustering in high-dimensional data streams
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
A fuzzy subspace algorithm for clustering high dimensional data
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Mining time-delayed coherent patterns in time series gene expression data
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
A metropolis sampling method for drawing representative samples from large databases
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Feature interaction in subspace clustering using the Choquet integral
Pattern Recognition
A robust seedless algorithm for correlation clustering
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A simple feature extraction for high dimensional image representations
SLSFS'05 Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection
Partitive clustering (K-means family)
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Reduct and variance based clustering of high dimensional dataset
ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Dependency clustering across measurement scales
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Hinging hyperplane models for multiple predicted variables
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
A survey on unsupervised outlier detection in high-dimensional numerical data
Statistical Analysis and Data Mining
On the equivalence of PLSI and projected clustering
ACM SIGMOD Record
Color Image Segmentation: From the View of Projective Clustering
International Journal of Multimedia Data Engineering & Management
Interactive data mining with 3D-parallel-coordinate-trees
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Fuzzy partition based soft subspace clustering and its applications in high dimensional data
Information Sciences: an International Journal
TSum: fast, principled table summarization
Proceedings of the Seventh International Workshop on Data Mining for Online Advertising
Finding multiple global linear correlations in sparse and noisy data sets
Knowledge-Based Systems
Hybrid entity clustering using crowds and data
The VLDB Journal — The International Journal on Very Large Data Bases
Survey of Clustering: Algorithms and Applications
International Journal of Information Retrieval Research
Semi-supervised projected model-based clustering
Data Mining and Knowledge Discovery
Subspace clustering of high-dimensional data: an evolutionary approach
Applied Computational Intelligence and Soft Computing
Hi-index | 0.00 |
High dimensional data has always been a challenge for clustering algorithms because of the inherent sparsity of the points. Recent research results indicate that in high dimensional data, even the concept of proximity or clustering may not be meaningful. We discuss very general techniques for projected clustering which are able to construct clusters in arbitrarily aligned subspaces of lower dimensionality. The subspaces are specific to the clusters themselves. This definition is substantially more general and realistic than currently available techniques which limit the method to only projections from the original set of attributes. The generalized projected clustering technique may also be viewed as a way of trying to redefine clustering for high dimensional applications by searching for hidden subspaces with clusters which are created by inter-attribute correlations. We provide a new concept of using extended cluster feature vectors in order to make the algorithm scalable for very large databases. The running time and space requirements of the algorithm are adjustable, and are likely ta tradeoff with better accuracy.