Automatic subspace clustering of high dimensional data for data mining applications

Authors:
Rakesh Agrawal;Johannes Gehrke;Dimitrios Gunopulos;Prabhakar Raghavan
Affiliations:
IBM Almaden Research Center, 650 Harry Road, San Jose, CA;IBM Almaden Research Center, 650 Harry Road, San Jose, CA;IBM Almaden Research Center, 650 Harry Road, San Jose, CA;IBM Almaden Research Center, 650 Harry Road, San Jose, CA
Venue:
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Year:
1998

Citing 26
Cited 532

CSG set-theoretic solid modelling and NC machining of blend surfaces

SCG '86 Proceedings of the second annual symposium on Computational geometry
Covering a simple orthogonal polygon with a minimum number of orthogonally convex polygons

SCG '87 Proceedings of the third annual symposium on Computational geometry
Algorithms for clustering data

Algorithms for clustering data
Performance guarantees on a sweep-line heuristic for covering rectilinear polygons with rectangles

SIAM Journal on Discrete Mathematics
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
On the hardness of approximating minimization problems

STOC '93 Proceedings of the twenty-fifth annual ACM symposium on Theory of computing
Mining quantitative association rules in large relational tables

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A threshold of ln n for approximating set cover (preliminary version)

STOC '96 Proceedings of the twenty-eighth annual ACM symposium on Theory of computing
Range queries in OLAP data cubes

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Dynamic itemset counting and implication rules for market basket data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Association rules over interval data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Advances in knowledge discovery and data mining

Advances in knowledge discovery and data mining
Bayesian classification (AutoClass): theory and results

Advances in knowledge discovery and data mining
Fast discovery of association rules

Advances in knowledge discovery and data mining
A cost model for nearest neighbor search in high-dimensional data space

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Data mining, hypergraph transversals, and machine learning (extended abstract)

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Efficiently mining long patterns from databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A comparative study of clustering methods

Future Generation Computer Systems - Special double issue on data mining
Stochastic Complexity in Statistical Inquiry Theory

Stochastic Complexity in Statistical Inquiry Theory
The Design and Analysis of Computer Algorithms

The Design and Analysis of Computer Algorithms
SLIQ: A Fast Scalable Classifier for Data Mining

EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
Pincer Search: A New Algorithm for Discovering the Maximum Frequent Set

EDBT '98 Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database Technology
Efficient and Effective Clustering Methods for Spatial Data Mining

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Sampling Large Databases for Association Rules

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
SPRINT: A Scalable Parallel Classifier for Data Mining

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases

Efficient algorithms for geometric optimization

ACM Computing Surveys (CSUR)
A framework for measuring changes in data characteristics

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
OPTICS: ordering points to identify the clustering structure

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Fast algorithms for projected clustering

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Scalable algorithms for mining large databases

KDD '99 Tutorial notes of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering techniques for large data sets—from the past to the future

KDD '99 Tutorial notes of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Entropy-based subspace clustering for mining numerical data

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
CACTUS—clustering categorical data using summaries

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
ACQ: an automatic clustering and querying approach for large image databases

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
A multiple-resolution method for edge-centric data clustering

Proceedings of the eighth international conference on Information and knowledge management
Finding generalized projected clusters in high dimensional spaces

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
LOF: identifying density-based local outliers

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
SQLEM: fast clustering in SQL using the EM algorithm

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Approximation algorithms for projective clustering

SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Efficient mining of weighted association rules (WAR)

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Feature selection in unsupervised learning via evolutionary search

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Identifying prospective customers

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering through decision tree construction

Proceedings of the ninth international conference on Information and knowledge management
High performance clustering based on the similarity join

Proceedings of the ninth international conference on Information and knowledge management
A cost model for query processing in high dimensional data spaces

ACM Transactions on Database Systems (TODS)
Information retrieval on the web

ACM Computing Surveys (CSUR)
Outlier detection for high dimensional data

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Database techniques for archival of solid models

Proceedings of the sixth ACM symposium on Solid modeling and applications
Systems support for scalable data mining

ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
Robust space transformations for distance-based operations

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient discovery of error-tolerant frequent itemsets in high dimensions

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A human-computer cooperative system for effective high dimensional clustering

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Induction of semantic classes from natural language text

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases

ACM Computing Surveys (CSUR)
Towards effective and interpretable data mining by visual interaction

ACM SIGKDD Explorations Newsletter
A new cell-based clustering method for large, high-dimensional data in data mining applications

Proceedings of the 2002 ACM symposium on Applied computing
Loglinear-Based Quasi Cubes

Journal of Intelligent Information Systems
Clustering by pattern similarity in large data sets

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A Monte Carlo algorithm for fast projective clustering

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
An evaluation of sampling methods for data mining with fuzzy C-means

Data mining for design and manufacturing
An iterative strategy for pattern discovery in high-dimensional data sets

Proceedings of the eleventh international conference on Information and knowledge management
FREM: fast and robust EM clustering for large data sets

Proceedings of the eleventh international conference on Information and knowledge management
Opening the black box: interactive hierarchical clustering for multivariate spatial patterns

Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Hyper-rectangle based segmentation and clustering of large video data sets

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Intelligent multimedia computing and networking
Multi-Level Clustering and its Visualization for Exploratory Spatial Analysis

Geoinformatica
Squeezer: an efficient algorithm for clustering categorical data

Journal of Computer Science and Technology
Clustering High Dimensional Massive Scientific Datasets

Journal of Intelligent Information Systems
DEMON: Mining and Monitoring Evolving Data

IEEE Transactions on Knowledge and Data Engineering
Finding Localized Associations in Market Basket Data

IEEE Transactions on Knowledge and Data Engineering
Redefining Clustering for High-Dimensional Applications

IEEE Transactions on Knowledge and Data Engineering
Clustering for Approximate Similarity Search in High-Dimensional Spaces

IEEE Transactions on Knowledge and Data Engineering
CLARANS: A Method for Clustering Objects for Spatial Data Mining

IEEE Transactions on Knowledge and Data Engineering
An evolutionary technique based on K-means algorithm for optimal clustering in RN

Information Sciences—Applications: An International Journal
DynDex: a dynamic and non-metric space indexer

Proceedings of the tenth ACM international conference on Multimedia
Using Projections to Visually Cluster High-Dimensional Data

Computing in Science and Engineering
Classification Rule Learning with APRIORI-C

EPIA '01 Proceedings of the10th Portuguese Conference on Artificial Intelligence on Progress in Artificial Intelligence, Knowledge Extraction, Multi-agent Systems, Logic Programming and Constraint Solving
Constraint-based clustering in large databases

ICDT '01 Proceedings of the 8th International Conference on Database Theory
Scalable Model for Extensional and Intensional Descriptions of Unclassified Data

IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
An Efficient Approach to Discovering Sequential Patterns in Large Databases

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Generalized Entropy and Projection Clustering of Categorical Data

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
A Data Set Oriented Approach for Clustering Algorithm Selection

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Automatic Construction and Refinement of a Class Hierarchy over Multi-valued Data

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Multiscale Comparison of Temporal Patternsin Time-Series Medical Databases

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
A Two-Level Method for Clustering DTDs

WAIM '00 Proceedings of the First International Conference on Web-Age Information Management
Using Loglinear Models to Compress Datacube

WAIM '00 Proceedings of the First International Conference on Web-Age Information Management
Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Finding Intensional Knowledge of Distance-Based Outliers

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Semantic Compression and Pattern Extraction with Fascicles

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
The 3W Model and Algebra for Unified Data Mining

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
CBCM: A Cell-Based Clustering Method for Data Mining Applications

WAIM '02 Proceedings of the Third International Conference on Advances in Web-Age Information Management
Interactive Clustering for Transaction Data

DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
CoFD: An Algorithm for Non-distance Based Clustering in High Dimensional Spaces

DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
A Hierarchical Model to Support Kansei Mining Process

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Mining N-most Interesting Itemsets

ISMIS '00 Proceedings of the 12th International Symposium on Foundations of Intelligent Systems
Rough Clustering: An Alternative to Find Meaningful Clusters by Using the Reducts from a Dataset

TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
Feature Selection for Clustering

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
A Visual Method of Cluster Validation with Fastmap

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Scalable Hierarchical Clustering Method for Sequences of Categorical Values

PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
M-FastMap: A Modified FastMap Algorithm for Visual Cluster Validation in Data Mining

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
On Data Clustering Analysis: Scalability, Constraints, and Validation

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Subspace Clustering Based on Compressibility

DS '02 Proceedings of the 5th International Conference on Discovery Science
Analyzing Data Clusters: A Rough Sets Approach to Extract Cluster-Defining Symbolic Rules

IDA '01 Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis
A Generalization-Based Approach to Clustering of Web Usage Sessions

WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
DROLAP - A Dense-Region Based Approach to On-Line Analytical Processing

DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
RecTree: An Efficient Collaborative Filtering Method

DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
Finding Dense Clusters in Hyperspace: An Approach Based on Row Shuffling

WAIM '01 Proceedings of the Second International Conference on Advances in Web-Age Information Management
WaveCluster: a wavelet-based clustering approach for spatial data in very large databases

The VLDB Journal — The International Journal on Very Large Data Bases
Clustering DTDs: an interactive two-level approach

Journal of Computer Science and Technology
A survey on wavelet applications in data mining

ACM SIGKDD Explorations Newsletter
Concise descriptions of subsets of structured sets

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
CLOPE: a fast and effective clustering algorithm for transactional data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering Data Streams: Theory and Practice

IEEE Transactions on Knowledge and Data Engineering
P-AutoClass: Scalable Parallel Clustering for Mining Large Data Sets

IEEE Transactions on Knowledge and Data Engineering
Approximation algorithms for projective clustering

Journal of Algorithms
Data mining for hypertext: a tutorial survey

ACM SIGKDD Explorations Newsletter
A Scalable Parallel Subspace Clustering Algorithm for Massive Data Sets

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Efficiently Detecting Arbitrary Shaped Clusters in Image Databases

ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Clustering in very large databases based on distance and density

Journal of Computer Science and Technology
ICEAGE: Interactive Clustering and Exploration of Large and High-Dimensional Geodata

Geoinformatica
Clustering binary data streams with K-means

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Clustering gene expression data in SQL using locally adaptive metrics

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Feature selection in data mining

Data mining
Efficient Biased Sampling for Approximate Clustering and Outlier Detection in Large Data Sets

IEEE Transactions on Knowledge and Data Engineering
Conceptual Clustering of Heterogeneous GeneExpression Sequences

Artificial Intelligence Review
Clustering intrusion detection alarms to support root cause analysis

ACM Transactions on Information and System Security (TISSEC)
Analyzing High-Dimensional Data by Subspace Validity

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Frequent-Pattern based Iterative Projected Clustering

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
OP-Cluster: Clustering by Tendency in High Dimensional Space

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
MaPle: A Fast Algorithm for Maximal Pattern-based Clustering

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Mining phenotypes and informative genes from gene expression data

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining multiple phenotype structures underlying gene expression profiles

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Maintaining discovered frequent itemsets: cases for changeable database and support

Journal of Computer Science and Technology
A Monotonic On-Line Linear Algorithm for Hierarchical Agglomerative Classification

Information Technology and Management
Detecting pattern-based outliers

Pattern Recognition Letters
A Human-Computer Interactive Method for Projected Clustering

IEEE Transactions on Knowledge and Data Engineering
Coordinating computational and visual approaches for interactive feature selection and multivariate clustering

Information Visualization - Special issue on coordinated and multiple views in exploratory visualization
Outlier analysis for gene expression data

Journal of Computer Science and Technology - Special issue on bioinformatics
Hypergraph Models and Algorithms for Data-Pattern-Based Clustering

Data Mining and Knowledge Discovery
Space-efficient cubes for OLAP range-sum queries

Decision Support Systems
Segmenting motion capture data into distinct behaviors

GI '04 Proceedings of the 2004 Graphics Interface Conference
Computing Clusters of Correlation Connected objects

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Subspace clustering for high dimensional data: a review

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
A New Conceptual Clustering Framework

Machine Learning
Document clustering via adaptive subspace iteration

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient Disk-Based K-Means Clustering for Relational Databases

IEEE Transactions on Knowledge and Data Engineering
Rapid detection of significant spatial clusters

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
On detecting space-time clusters

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering moving objects

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for ontology-driven subspace clustering

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Sleeved coclustering

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting predicates from mining models for efficient query evaluation

ACM Transactions on Database Systems (TODS)
Feature Selection for Unsupervised Learning

The Journal of Machine Learning Research
Mining Frequent Itemsets without Support Threshold: With and without Item Constraints

IEEE Transactions on Knowledge and Data Engineering
Simultaneous Feature Selection and Clustering Using Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Cluster Analysis for Gene Expression Data: A Survey

IEEE Transactions on Knowledge and Data Engineering
HARP: A Practical Projected Clustering Algorithm

IEEE Transactions on Knowledge and Data Engineering
Clustering High-Dimensional Data with Low-Order Neighbors

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Framework and algorithms for trend analysis in massive temporal data sets

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Automatic image annotation and retrieval using subspace clustering algorithm

Proceedings of the 2nd ACM international workshop on Multimedia databases
Iterative Projected Clustering by Subspace Mining

IEEE Transactions on Knowledge and Data Engineering
Identifying projected clusters from gene expression profiles

Journal of Biomedical Informatics
Subspace clustering for high dimensional categorical data

ACM SIGKDD Explorations Newsletter
Projective Clustering by Histograms

IEEE Transactions on Knowledge and Data Engineering
Antipole Tree Indexing to Support Range Search and K-Nearest Neighbor Search in Metric Spaces

IEEE Transactions on Knowledge and Data Engineering
Automated Variable Weighting in k-Means Type Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
On Discovery of Extremely Low-Dimensional Clusters Using Semi-Supervised Projected Clustering

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
k-means projective clustering

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
An effective and efficient algorithm for high-dimensional outlier detection

The VLDB Journal — The International Journal on Very Large Data Bases
Concise descriptions of subsets of structured sets

ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
CURLER: finding and visualizing nonlinear correlation clusters

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Automatic Subspace Clustering of High Dimensional Data

Data Mining and Knowledge Discovery
GCHL: A grid-clustering algorithm for high-dimensional very large spatial data bases

Pattern Recognition Letters
Dimension induced clustering

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Feature bagging for outlier detection

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A general model for clustering binary data

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Detection of emerging space-time clusters

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
CLICKS: an effective algorithm for mining subspace clusters in categorical datasets

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Unsupervised anomaly detection in network intrusion detection using clusters

ACSC '05 Proceedings of the Twenty-eighth Australasian conference on Computer Science - Volume 38
A Shrinking-Based Clustering Approach for Multidimensional Data

IEEE Transactions on Knowledge and Data Engineering
Knowledge discovery by probabilistic clustering of distributed databases

Data & Knowledge Engineering
Clustering Ensembles: Models of Consensus and Weak Partitions

IEEE Transactions on Pattern Analysis and Machine Intelligence
Clustering high-dimensional data using an efficient and effective data space reduction

Proceedings of the 14th ACM international conference on Information and knowledge management
A rank-by-feature framework for interactive exploration of multidimensional data

Information Visualization
Finding Frequent Patterns in a Large Sparse Graph*

Data Mining and Knowledge Discovery
A Generic Framework for Efficient Subspace Clustering of High-Dimensional Data

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A Levelwise Search Algorithm for Interesting Subspace Clusters

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Categorization and Keyword Identification of Unlabeled Documents

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Mining Quantitative Frequent Itemsets Using Adaptive Density-Based Subspace Clustering

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Matrix approximation and projective clustering via volume sampling

SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
Knowledge map creation and maintenance for virtual communities of practice

Information Processing and Management: an International Journal
Dynamic Cluster Formation Using Level Set Methods

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic image annotation and retrieval using weighted feature selection

Multimedia Tools and Applications
MicroCluster: Efficient Deterministic Biclustering of Microarray Data

IEEE Intelligent Systems
Evolving Feature Selection

IEEE Intelligent Systems
Comparing Subspace Clusterings

IEEE Transactions on Knowledge and Data Engineering
On the use of Human-Computer Interaction for Projected Nearest Neighbor Search

Data Mining and Knowledge Discovery
Graph-based synopses for relational selectivity estimation

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
PENS: an algorithm for density-based clustering in peer-to-peer systems

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Deriving quantitative models for correlation clusters

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Robust information-theoretic clustering

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering significant OPSM subspace clusters in massive gene expression data

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Projective clustering using itemset discovery for multi-dimensional data analysis

MS'06 Proceedings of the 17th IASTED international conference on Modelling and simulation
Adaptive non-linear clustering in data streams

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
On subspace clustering with density consciousness

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Physical Database Design: the database professional's guide to exploiting indexes, views, storage, and more

Physical Database Design: the database professional's guide to exploiting indexes, views, storage, and more
Clicks: An effective algorithm for mining subspace clusters in categorical datasets

Data & Knowledge Engineering
Gradual model generator for single-pass clustering

Pattern Recognition
Finding biclusters by random projections

Theoretical Computer Science
Cell-nuclear data reduction and prognostic model selection in bladder tumor recurrence

Artificial Intelligence in Medicine
A parallel hierarchical clustering algorithm for PCs cluster system

Neurocomputing
Locally adaptive metrics for clustering high dimensional data

Data Mining and Knowledge Discovery
NOCEA: A rule-based evolutionary algorithm for efficient and effective clustering of massive high-dimensional databases

Applied Soft Computing
A threshold criterion, auto-detection and its use in MST-based clustering

Intelligent Data Analysis
Bi-criteria linear-time approximations for generalized k-mean/median/center

SCG '07 Proceedings of the twenty-third annual symposium on Computational geometry
Building statistical models and scoring with UDFs

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Linear manifold clustering in high dimensional spaces by stochastic search

Pattern Recognition
Quality-Aware Sampling and Its Applications in Incremental Data Mining

IEEE Transactions on Knowledge and Data Engineering
Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing

Knowledge and Information Systems
Toward Exploratory Test-Instance-Centered Diagnosis in High-Dimensional Classification

IEEE Transactions on Knowledge and Data Engineering
An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data

IEEE Transactions on Knowledge and Data Engineering
A local-density based spatial clustering algorithm with noise

Information Systems
A new data clustering approach: Generalized cellular automata

Information Systems
Finding low-entropy sets and trees from binary data

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Enhancing semi-supervised clustering: a feature projection perspective

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
The generalized MDL approach for summarization

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A trimmed mean approach to finding spatial outliers

Intelligent Data Analysis
Algorithms for clustering high dimensional and distributed data

Intelligent Data Analysis
Evolutionary model selection in unsupervised learning

Intelligent Data Analysis
Mining association rules using clustering

Intelligent Data Analysis
RIC: Parameter-free noise-robust clustering

ACM Transactions on Knowledge Discovery from Data (TKDD)
Top-Down Parameter-Free Clustering of High-Dimensional Categorical Data

IEEE Transactions on Knowledge and Data Engineering
Detecting eye fixations by projection clustering

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A neural-network-based approach to detecting rectangular objects

Neurocomputing
A shrinking-based approach for multi-dimensional data analysis

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A framework for projected clustering of high dimensional data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Grid-based subspace clustering over data streams

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Learning correlations using the mixture-of-subsets model

ACM Transactions on Knowledge Discovery from Data (TKDD)
A clustering framework based on subjective and objective validity criteria

ACM Transactions on Knowledge Discovery from Data (TKDD)
A genetic approach for efficient outlier detection in projected space

Pattern Recognition
Continuous subspace clustering in streaming time series

Information Systems
Mining approximate top-k subspace anomalies in multi-dimensional time-series data

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Random walk biclustering for microarray data

Information Sciences: an International Journal
Biclustering in data mining

Computers and Operations Research
A convergence theorem for the fuzzy subspace clustering (FSC) algorithm

Pattern Recognition
Automatic kernel clustering with a Multi-Elitist Particle Swarm Optimization Algorithm

Pattern Recognition Letters
A Novel Biologically and Psychologically Inspired Fuzzy Decision Support System: Hierarchical Complementary Learning

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
VISA: visual subspace clustering analysis

ACM SIGKDD Explorations Newsletter - Special issue on visual analytics
An adaptable deflect and conquer clustering algorithm

ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
Dynamic pattern discovery using multi-agent technology

TELE-INFO'07 Proceedings of the 6th WSEAS Int. Conference on Telecommunications and Informatics
An adaptive crossover-imaged clustering algorithm

SMO'07 Proceedings of the 7th WSEAS International Conference on Simulation, Modelling and Optimization
SCHISM: a new approach to interesting subspace mining

International Journal of Business Intelligence and Data Mining
A hierarchical model-based approach to co-clustering high-dimensional data

Proceedings of the 2008 ACM symposium on Applied computing
Clustering techniques utilized in web usage mining

AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
A general grid-clustering approach

Pattern Recognition Letters
Mining multiple-level fuzzy blocks from multidimensional data

Fuzzy Sets and Systems
A new approach for evaluating agility in supply chains using Fuzzy Association Rules Mining

Engineering Applications of Artificial Intelligence
A classification method based on subspace clustering and association rules

New Generation Computing
Outlier-robust clustering using independent components

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Automatic clustering and boundary detection algorithm based on adaptive influence function

Pattern Recognition
SS-ClusterTree: a subspace clustering based indexing algorithm over high-dimensional image features

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Mining typical patterns from databases

Information Sciences: an International Journal
Finding non-redundant, statistically significant regions in high dimensional data: a novel approach to projected and subspace clustering

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Succinct summarization of transactional databases: an overlapped hyperrectangle scheme

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Morpheus: interactive exploration of subspace clustering

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Higher order mining

ACM SIGKDD Explorations Newsletter
Optimal subspace dimensionality for k-nearest-neighbor queries on clustered and dimensionality reduced datasets with SVD

Multimedia Tools and Applications
Locally Scaled Density Based Clustering

ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
High-Dimensional Clustering Method for High Performance Data Mining

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Data Set Homeomorphism Transformation Based Meta-clustering

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
The Study of Dynamic Aggregation of Relational Attributes on Relational Data Mining

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Flexible Grid-Based Clustering

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Unsupervised Anomaly Detection Using HDG-Clustering Algorithm

Neural Information Processing
ELKI: A Software System for Evaluation of Subspace Clustering Algorithms

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
A Multi-Objective Multipopulation Approach for Biclustering

ICARIS '08 Proceedings of the 7th international conference on Artificial Immune Systems
Continuous Clustering of Moving Objects in Spatial Networks

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Detecting Current Outliers: Continuous Outlier Detection over Time-Series Data Streams

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
Clustering Distributed Sensor Data Streams

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Pleiades: Subspace Clustering and Evaluation

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Clustering high dimensional data: A graph-based relaxed optimization approach

Information Sciences: an International Journal
Memory efficient subspace clustering for online data streams

IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Detecting clusters in moderate-to-high dimensional data: subspace clustering, pattern-based clustering, and correlation clustering

Proceedings of the VLDB Endowment
An entropy clustering analysis based on genetic algorithm

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Fuzzy theory and technology with applications
An axis-shifted crossover-imaged clustering algorithm

WSEAS TRANSACTIONS on SYSTEMS
A deflected grid-based algorithm for clustering analysis

WSEAS Transactions on Computers
EDSC: efficient density-based subspace clustering

Proceedings of the 17th ACM conference on Information and knowledge management
Analyzing eye fixations and gaze orientations on films and pictures

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Incremental clustering of dynamic data streams using connectivity based representative points

Data & Knowledge Engineering
Image-mapped data clustering: An efficient technique for clustering large data sets

Intelligent Data Analysis
NPClu: An approach for clustering spatially extended objects

Intelligent Data Analysis
Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

ACM Transactions on Knowledge Discovery from Data (TKDD)
Multifractal-based cluster hierarchy optimisation algorithm

International Journal of Business Intelligence and Data Mining
Constraint-based clustering and its applications in construction management

Expert Systems with Applications: An International Journal
Robust clustering analysis for the management of self-monitoring distributed systems

Cluster Computing
TuG synopses for approximate query answering

ACM Transactions on Database Systems (TODS)
Efficiently tracing clusters over high-dimensional on-line data streams

Data & Knowledge Engineering
A distance-relatedness dynamic model for clustering high dimensional data of arbitrary shapes and densities

Pattern Recognition
Efficient layered density-based clustering of categorical data

Journal of Biomedical Informatics
Deriving strong association mining rules using a dependency criterion, the lift measure

International Journal of Data Analysis Techniques and Strategies
DECODE: a new method for discovering clusters of different densities in spatial data

Data Mining and Knowledge Discovery
SLICE: A Novel Method to Find Local Linear Correlations by Constructing Hyperplanes

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Clustering by pattern similarity

Journal of Computer Science and Technology
PFHC: A clustering algorithm based on data partitioning for unevenly distributed datasets

Fuzzy Sets and Systems
Top-k typicality queries and efficient query answering methods on large databases

The VLDB Journal — The International Journal on Very Large Data Bases
Parametric and nonparametric evolutionary computing with a content-based feature selection approach for parallel categorization

Expert Systems with Applications: An International Journal
CP-summary: a concise representation for browsing frequent itemsets

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Query result clustering for object-level search

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A semi-supervised approach to projected clustering with applications to microarray data

International Journal of Data Mining and Bioinformatics
SDCC: A New Stable Double-Centroid Clustering Technique Based on K-Means for Non-spherical Patterns

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
HSM: Heterogeneous Subspace Mining in High Dimensional Data

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
A Bipartite Graph Framework for Summarizing High-Dimensional Binary, Categorical and Numeric Data

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Application of Clustering Techniques in a Network Security Testing System

Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Study on Rough set-based Clustering Result Presentation Method for High Dimensional Data Space

Proceedings of the 2006 conference on Leading the Web in Concurrent Engineering: Next Generation Concurrent Engineering
Subspace sums for extracting non-random data from massive noise

Knowledge and Information Systems
NPUST: An Efficient Clustering Algorithm Using Partition Space Technique for Large Databases

IEA/AIE '09 Proceedings of the 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: Next-Generation Applied Intelligence
FARM: a new efficient and effective data clustering algorithm

MUSP'09 Proceedings of the 9th WSEAS international conference on Multimedia systems & signal processing
A Framework for Trajectory Clustering

GSN '09 Proceedings of the 3rd International Conference on GeoSensor Networks
SubCOID: an attempt to explore cluster-outlier iterative detection approach to multi-dimensional data analysis in subspace

Proceedings of the 46th Annual Southeast Regional Conference on XX
FuzzyShrinking: improving shrinking-based data mining algorithms using fuzzy concept for multi-dimensional data

Proceedings of the 46th Annual Southeast Regional Conference on XX
Data spread-based entropy clustering method using adaptive learning

Expert Systems with Applications: An International Journal
Efficient Clustering of Web-Derived Data Sets

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
A comprehensive survey of numeric and symbolic outlier mining techniques

Intelligent Data Analysis
K-Subspace Clustering

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
History Guided Low-Cost Change Detection in Streams

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Discovering pattern-based subspace clusters by pattern tree

Knowledge-Based Systems
Fragmenting very large XML data warehouses via K-means clustering algorithm

International Journal of Business Intelligence and Data Mining
Density-based clustering using graphics processors

Proceedings of the 18th ACM conference on Information and knowledge management
Rank-aware clustering of structured datasets

Proceedings of the 18th ACM conference on Information and knowledge management
Interpretable and reconfigurable clustering of document datasets by deriving word-based rules

Proceedings of the 18th ACM conference on Information and knowledge management
Generalized fuzzy C-means clustering algorithm with improved fuzzy partitions

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Enhanced soft subspace clustering integrating within-cluster and between-cluster information

Pattern Recognition
An efficient algorithm for finding dense regions for mining quantitative association rules

Computers & Mathematics with Applications
Parallel clustering of high dimensional data by integrating multi-objective genetic algorithm with divide and conquer

Applied Intelligence
Subspace and projected clustering: experimental evaluation and analysis

Knowledge and Information Systems
EIDBSCAN: An Extended Improving DBSCAN algorithm with sampling techniques

International Journal of Business Intelligence and Data Mining
Using trees to depict a forest

Proceedings of the VLDB Endowment
Graph clustering based on structural/attribute similarities

Proceedings of the VLDB Endowment
Evaluating clustering in subspace projections of high dimensional data

Proceedings of the VLDB Endowment
Improving a multi-objective multipopulation artificial immune network for biclustering

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Projected Gustafson Kessel Clustering

RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
COP: privacy-preserving multidimensional partition in DAS paradigm

Proceedings of the 2009 EDBT/ICDT Workshops
Data clustering using a modified Kuwahara filter

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Knowledge map creation and maintenance for virtual communities of practice

Information Processing and Management: an International Journal
Text clustering algorithm based on spectral graph seriation

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
A semi-supervised clustering algorithm based on rough reduction

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
ISMCS: an intelligent instruction sequence based malware categorization system

ASID'09 Proceedings of the 3rd international conference on Anti-Counterfeiting, security, and identification in communication
A signal filter based clustering algorithm

WiCOM'09 Proceedings of the 5th International Conference on Wireless communications, networking and mobile computing
Optimization on Lie manifolds and pattern recognition

Pattern Recognition
SKM-SNP: SNP markers detection method

Journal of Biomedical Informatics
Anomaly intrusion detection by clustering transactional audit streams in a host computer

Information Sciences: an International Journal
Data clustering: 50 years beyond K-means

Pattern Recognition Letters
Mining comprehensible clustering rules with an evolutionary algorithm

GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
An applicable hierarchical clustering algorithm for content-based image retrieval

MIRAGE'07 Proceedings of the 3rd international conference on Computer vision/computer graphics collaboration techniques
AGRID: an efficient algorithm for clustering large high-dimensional datasets

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
An efficient cell-based clustering method for handling large, high-dimensional data

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Hierarchical density-based clustering of categorical data and a simplification

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
A clustering algorithm based on mechanics

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
A fast algorithm for finding correlation clusters in noise data

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
BRIM: an efficient boundary points detecting algorithm

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
ANGEL: a new effective and efficient hybrid clustering technique for large databases

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
A continuous-based approach for partial clique enumeration

GbRPR'07 Proceedings of the 6th IAPR-TC-15 international conference on Graph-based representations in pattern recognition
Mining time-shifting co-regulation patterns from gene expression data

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Clustering by random projections

ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Automatic extraction of business rules to improve quality in planning and consolidation in transport logistics based on multi-agent clustering

AIS-ADM'07 Proceedings of the 2nd international conference on Autonomous intelligent systems: agents and data mining
Robust Algebraic Segmentation of Mixed Rigid-Body and Planar Motions from Two Views

International Journal of Computer Vision
Conditionals in nonmonotonic reasoning and belief revision: considering conditionals as agents

Conditionals in nonmonotonic reasoning and belief revision: considering conditionals as agents
Applying biclustering to text mining: an immune-inspired approach

ICARIS'07 Proceedings of the 6th international conference on Artificial immune systems
Discretization numbers for multiple-instances problem in relational database

ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Grid-based clustering algorithm based on intersecting partition and density estimation

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
DBSC: a dependency-based subspace clustering algorithm for high dimensional numerical datasets

AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Maximum item first pattern growth for mining frequent patterns

RSFDGrC'03 Proceedings of the 9th international conference on Rough sets, fuzzy sets, data mining, and granular computing
An adaptive and efficient unsupervised shot clustering algorithm for sports video

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Detection and visualization of subspace cluster hierarchies

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Clustering moving objects in spatial networks

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
DGDCT: a distributed grid-density based algorithm for intrinsic cluster detection over massive spatial data

ICDCN'08 Proceedings of the 9th international conference on Distributed computing and networking
SubClass: classification of multidimensional noisy data using subspace clusters

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
G-TREACLE: a new grid-based and tree-alike pattern clustering technique for large databases

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
A creditable subspace labeling method based on D-S evidence theory

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Quantization-based clustering algorithm

Pattern Recognition
Word clustering with validity indices

Canadian AI'08 Proceedings of the Canadian Society for computational studies of intelligence, 21st conference on Advances in artificial intelligence
HDG-tree: a structure for clustering high-dimensional data streams

IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Clustering high dimensional data streams with representative points

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Genetic algorithm-based high-dimensional data clustering technique

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Mining representative subspace clusters in high-dimensional data

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Model-based subspace clustering of non-Gaussian data

Neurocomputing
Uniqueness mining

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Distance based feature selection for clustering microarray data

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Scalable Clustering for Mining Local-Correlated Clusters in High Dimensions and Large Datasets

Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
Mining Outliers in Correlated Subspaces for High Dimensional Data Sets

Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
Autonomic policy adaptation using decentralized online clustering

Proceedings of the 7th international conference on Autonomic computing
Netcluster: a clustering-based framework for internet tomography

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
An estimation of distribution algorithm for the automatic generation of clustering algorithms

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Learning in parallel universes

Data Mining and Knowledge Discovery
Towards subspace clustering on dynamic data: an incremental version of PreDeCon

Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Clustering by synchronization

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Mixture models for learning low-dimensional roles in high-dimensional data

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning multiple nonredundant clusterings

ACM Transactions on Knowledge Discovery from Data (TKDD)
Outlier detection in transactional data

Intelligent Data Analysis
Enhancing effectiveness of density-based outlier mining scheme with density-similarity-neighbor-based outlier factor

Expert Systems with Applications: An International Journal
Query expansion using an immune-inspired biclustering algorithm

Natural Computing: an international journal
Density-based semi-supervised clustering

Data Mining and Knowledge Discovery
PHD: an efficient data clustering scheme using partition space technique for knowledge discovery in large databases

Applied Intelligence
Automatic parameter determination in subspace clustering with gravitation function

Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Selection of effective network parameters in attacks for intrusion detection

ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Specialty mining

DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
Metric and trigonometric pruning for clustering of uncertain data in 2D geometric space

Information Systems
A Density-based Hierarchical Clustering Algorithm for Highly Overlapped Distributions with Noisy Points

Proceedings of the 2010 conference on Artificial Intelligence Research and Development: Proceedings of the 13th International Conference of the Catalan Association for Artificial Intelligence
Distributed antipole clustering for efficient data search and management in Euclidean and metric spaces

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Mining group-based knowledge flows for sharing task knowledge

Decision Support Systems
Mining relaxed closed subspace clusters

Proceedings of the 48th Annual Southeast Regional Conference
Towards improving subspace data analysis

Proceedings of the 48th Annual Southeast Regional Conference
Inter-dimensional fuzzy clustering

Proceedings of the 48th Annual Southeast Regional Conference
Clustering Large Attributed Graphs: A Balance between Structural and Attribute Similarities

ACM Transactions on Knowledge Discovery from Data (TKDD)
Clustering distributed sensor data streams using local processing and reduced communication

Intelligent Data Analysis - Ubiquitous Knowledge Discovery
Exploring high-D spaces with multiform matrices and small multiples

INFOVIS'03 Proceedings of the Ninth annual IEEE conference on Information visualization
DenVOICE: a new density-partitioning clustering technique based on congregation of dense voronoi cells for non-spherical patterns

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume PartI
Subspace clustering for indexing high dimensional data: a main memory index based on local reductions and individual multi-representations

Proceedings of the 14th International Conference on Extending Database Technology
Making interval-based clustering rank-aware

Proceedings of the 14th International Conference on Extending Database Technology
A k-means type clustering algorithm for subspace clustering of mixed numeric and categorical datasets

Pattern Recognition Letters
APSCAN: A parameter free algorithm for clustering

Pattern Recognition Letters
Fast outlier detection for very large log data

Expert Systems with Applications: An International Journal
An entropy weighting mixture model for subspace clustering of high-dimensional data

Pattern Recognition Letters
Active learning and subspace clustering for anomaly detection

Intelligent Data Analysis
A survey on clustering in data mining

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Visualization technology for the financial decision models of DSS

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Systematic analysis of OCS testing data

IMCAS'06 Proceedings of the 5th WSEAS international conference on Instrumentation, measurement, circuits and systems
A DGC-based data classification method used for abnormal network intrusion detection

ICONIP'06 Proceedings of the 13th international conference on Neural information processing - Volume Part III
Minimum spanning tree based split-and-merge: A hierarchical clustering method

Information Sciences: an International Journal
A subspace decision cluster classifier for text classification

Expert Systems with Applications: An International Journal
Summarizing transactional databases with overlapped hyperrectangles

Data Mining and Knowledge Discovery
A novel attribute weighting algorithm for clustering high-dimensional categorical data

Pattern Recognition
Enhancing grid-density based clustering for high dimensional data

Journal of Systems and Software
Projected Gustafson-Kessel clustering algorithm and its convergence

Transactions on rough sets XIV
DisClus: a distributed clustering technique over high resolution satellite data

ICDCN'10 Proceedings of the 11th international conference on Distributed computing and networking
CHIRP: a new classifier based on composite hypercubes on iterated random projections

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering very large multi-dimensional datasets with MapReduce

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Agent-based subspace clustering

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Tracing evolving clusters by subspace and value similarity

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
An extension of the PMML standard to subspace clustering models

Proceedings of the 2011 workshop on Predictive markup language modeling
Spatial clustering to uncluttering map visualization in SOLAP

ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part I
A feature group weighting method for subspace clustering of high-dimensional data

Pattern Recognition
Efficient selectivity estimation by histogram construction based on subspace clustering

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Density based subspace clustering over dynamic data

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Exploratory hierarchical clustering for management zone delineation in precision agriculture

ICDM'11 Proceedings of the 11th international conference on Advances in data mining: applications and theoretical aspects
DB-CSC: a density-based approach for subspace clustering in graphs with feature vectors

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Synthesizing routes for low sampling trajectories with absorbing Markov chains

WAIM'11 Proceedings of the 12th international conference on Web-age information management
A graph model for mutual information based clustering

Journal of Intelligent Information Systems
EEW-SC: Enhanced Entropy-Weighting Subspace Clustering for high dimensional gene expression data clustering analysis

Applied Soft Computing
A clustering algorithm for multiple data streams based on spectral component similarity

Information Sciences: an International Journal
Enhancing Community Discovery and Characterization in VCoP Using Topic Models

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Scalable density-based subspace clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
External evaluation measures for subspace clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
An equity-based and cell-based spatial object fusion method

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
A grid clustering algorithm based on reference and density

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
Supervised learning in parallel universes using neighborgrams

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Fast mining erasable itemsets using NC_sets

Expert Systems with Applications: An International Journal
Anomaly intrusion detection based on clustering a data stream

ISC'06 Proceedings of the 9th international conference on Information Security
CLINCH: clustering incomplete high-dimensional data for data mining application

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Indexing text and visual features for WWW images

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Subspace clustering of microarray data based on domain transformation

VDMB'06 Proceedings of the First international conference on Data Mining and Bioinformatics
Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

Statistics and Computing
Finding hierarchies of subspace clusters

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Autonomous visualization

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Hyperspectral data selection from mutual information between image bands

SSPR'06/SPR'06 Proceedings of the 2006 joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
A near-linear algorithm for projective clustering integer points

Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
An adaptive nearest neighbor classification algorithm for data streams

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Deriving class association rules based on levelwise subspace clustering

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Grid-ODF: detecting outliers effectively and efficiently in large multi-dimensional databases

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
Significance and recovery of block structures in binary matrices with noise

COLT'06 Proceedings of the 19th annual conference on Learning Theory
Research paper recommender systems: a subspace clustering approach

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
On the performance of feature weighting K-means for text subspace clustering

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
A clustering algorithm based absorbing nearest neighbors

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
Adapting k-means algorithm for discovering clusters in subspaces

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Generalized projected clustering in high-dimensional data streams

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
SCUBA: scalable cluster-based algorithm for evaluating continuous spatio-temporal queries on moving objects

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
A fuzzy subspace algorithm for clustering high dimensional data

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
OSDM: optimized shape distribution method

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Mining MOUCLAS patterns and jumping MOUCLAS patterns to construct classifiers

Data Mining
Tracing Evolving Subspace Clusters in Temporal Climate Data

Data Mining and Knowledge Discovery
ISIS: a new approach for efficient similarity search in sparse databases

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
A grid-density based technique for finding clusters in satellite image

Pattern Recognition Letters
Mutagenicity risk analysis by using class association rules

JSAI'05 Proceedings of the 2005 international conference on New Frontiers in Artificial Intelligence
Succinct and informative cluster descriptions for document repositories

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
SSC: statistical subspace clustering

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Linear manifold clustering

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
A grid-based clustering algorithm for high-dimensional data streams

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Dynamic cluster formation using level set methods

PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
An incremental data stream clustering algorithm based on dense units detection

PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Intelligent database distribution on a grid using clustering

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Towards an ontology-based spatial clustering framework

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Sub-space clustering, inter-clustering results association & anomaly correlation for unsupervised network anomaly detection

Proceedings of the 7th International Conference on Network and Services Management
A new cell-based clustering method for high-dimensional data mining applications

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
Feature interaction in subspace clustering using the Choquet integral

Pattern Recognition
Unsupervised case memory organization: analysing computational time and soft computing capabilities

ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
On approximation algorithms for data mining applications

Efficient Approximation and Online Algorithms
The class cover problem with boxes

Computational Geometry: Theory and Applications
A grid-based subspace clustering algorithm for high-dimensional data streams

WISE'06 Proceedings of the 7th international conference on Web Information Systems
SDI: shape distribution indicator and its application to find interrelationships between physical activity tests and other medical measures

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
A robust seedless algorithm for correlation clustering

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Clustering in applications with multiple data sources-A mutual subspace clustering approach

Neurocomputing
SpaGRID: a spatial grid framework for high dimensional medical databases

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
Mining temporal patterns in popularity of web items

Information Sciences: an International Journal
Towards an integrated e-mail forensic analysis framework

Digital Investigation: The International Journal of Digital Forensics & Incident Response
Exploiting constraint inconsistence for dimension selection in subspace clustering: A semi-supervised approach

Neurocomputing
Subspace clustering

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Clustering high dimensional data

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Subspace correlation clustering: finding locally correlated dimensions in subspace projections of the data

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Co-occurring cluster mining for damage patterns analysis of a fuel cell

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Finding gene coherent patterns using PATSUB+

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Design and evaluation of decentralized online clustering

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
MapReduce algorithms for big data analysis

Proceedings of the VLDB Endowment
Mining time-gap sequential patterns

IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
Visual spam campaigns analysis using abstract graphs representation

Proceedings of the Ninth International Symposium on Visualization for Cyber Security
Substantial improvements in the set-covering projection classifier CHIRP (composite hypercubes on iterated random projections)

ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on the Best of SIGKDD 2011
Multi-scale decomposition of point process data

Geoinformatica
Document recommendations based on knowledge flows: A hybrid of personalized and group-based approaches

Journal of the American Society for Information Science and Technology
Exclusive and complete clustering of streams

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Continuous adaptive outlier detection on distributed data streams

HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Density based grid clustering partition of the input space for RBF neural network

AICI'12 Proceedings of the 4th international conference on Artificial Intelligence and Computational Intelligence
Post-processing strategies for improving local gene expression pattern analysis

International Journal of Data Mining and Bioinformatics
Visualizing high-dimensional structures by dimension ordering and filtering using subspace analysis

EuroVis'11 Proceedings of the 13th Eurographics / IEEE - VGTC conference on Visualization
Fuzzy c-means improvement using relaxed constraints support vector machines

Applied Soft Computing
Enhancing density-based clustering: Parameter reduction and outlier detection

Information Systems
Iterative evolutionary subspace clustering

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
A survey on enhanced subspace clustering

Data Mining and Knowledge Discovery
On the equivalence of PLSI and projected clustering

ACM SIGMOD Record
Projective clustering ensembles

Data Mining and Knowledge Discovery
ASCCN: Arbitrary Shaped Clustering Method with Compatible Nucleoids

International Journal of Data Warehousing and Mining
Spatial Clustering in SOLAP Systems to Enhance Map Visualization

International Journal of Data Warehousing and Mining
A weighting k-modes algorithm for subspace clustering of categorical data

Neurocomputing
Using Multidimensional Clustering Based Collaborative Filtering Approach Improving Recommendation Diversity

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Novel soft subspace clustering with multi-objective evolutionary approach for high-dimensional data

Pattern Recognition
Identifying hidden geospatial resources in catalogues

Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Warped K-Means: An algorithm to cluster sequentially-distributed data

Information Sciences: an International Journal
Learning a subspace for clustering via pattern shrinking

Information Processing and Management: an International Journal
Fuzzy partition based soft subspace clustering and its applications in high dimensional data

Information Sciences: an International Journal
Clustering based on a near neighbor graph and a grid cell graph

Journal of Intelligent Information Systems
A clustering ensemble framework based on elite selection of weighted clusters

Advances in Data Analysis and Classification
How to "alternatize" a clustering algorithm

Data Mining and Knowledge Discovery
Automatic player behavior analysis system using trajectory data in a massive multiplayer online game

Multimedia Tools and Applications
GPUMAFIA: efficient subspace clustering with MAFIA on GPUs

Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Mining order-preserving submatrices from probabilistic matrices

ACM Transactions on Database Systems (TODS)
Finding multiple global linear correlations in sparse and noisy data sets

Knowledge-Based Systems
NetCluster: A clustering-based framework to analyze internet passive measurements data

Computer Networks: The International Journal of Computer and Telecommunications Networking
Hybrid entity clustering using crowds and data

The VLDB Journal — The International Journal on Very Large Data Bases
MEI: An efficient algorithm for mining erasable itemsets

Engineering Applications of Artificial Intelligence
Model-based clustering of high-dimensional data: A review

Computational Statistics & Data Analysis
Mining non-redundant time-gap sequential patterns

Applied Intelligence
Survey of Clustering: Algorithms and Applications

International Journal of Information Retrieval Research
Automatic identification of application I/O signatures from noisy server-side traces

FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
A re-coloring approach for graph b-coloring based clustering

International Journal of Knowledge-based and Intelligent Engineering Systems
A multivariate fuzzy system applied for outliers detection

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Semi-supervised projected model-based clustering

Data Mining and Knowledge Discovery
Subspace clustering of high-dimensional data: an evolutionary approach

Applied Computational Intelligence and Soft Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Data mining applications place special requirements on clustering algorithms including: the ability to find clusters embedded in subspaces of high dimensional data, scalability, end-user comprehensibility of the results, non-presumption of any canonical data distribution, and insensitivity to the order of input records. We present CLIQUE, a clustering algorithm that satisfies each of these requirements. CLIQUE identifies dense clusters in subspaces of maximum dimensionality. It generates cluster descriptions in the form of DNF expressions that are minimized for ease of comprehension. It produces identical results irrespective of the order in which input records are presented and does not presume any specific mathematical form for data distribution. Through experiments, we show that CLIQUE efficiently finds accurate cluster in large high dimensional datasets.