Adaptive dimension reduction for clustering high dimensional data

Authors:
Chris Ding;Xiaofeng He;Hongyuan Zha;Horst D. Simon
Affiliations:
-;-;-;-
Venue:
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Year:
2002

Citing 0
Cited 30

A symbolic representation of time series, with implications for streaming algorithms

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Constructing internet coordinate system based on delay measurement

Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
Subspace clustering for high dimensional data: a review

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Document clustering via adaptive subspace iteration

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Meanshift Clustering for DNA Microarray Analysis

CSB '04 Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference
Constructing internet coordinate system based on delay measurement

IEEE/ACM Transactions on Networking (TON)
A general model for clustering binary data

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Adaptive dimension reduction using discriminant analysis and K-means clustering

Proceedings of the 24th international conference on Machine learning
Experiencing SAX: a novel symbolic representation of time series

Data Mining and Knowledge Discovery
SCHISM: a new approach to interesting subspace mining

International Journal of Business Intelligence and Data Mining
Unsupervised Topic Detection in document collections: an application in marketing and business journals

International Journal of Business Intelligence and Data Mining
Efficient layered density-based clustering of categorical data

Journal of Biomedical Informatics
An island model for high-dimensional genomes using phylogenetic speciation and species barcoding

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Subspace maximum margin clustering

Proceedings of the 18th ACM conference on Information and knowledge management
Spectral embedded clustering

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Clustering algorithms optimizer: a framework for large datasets

ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
An adaptive and efficient unsupervised shot clustering algorithm for sports video

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Normalized dimensionality reduction using nonnegative matrix factorization

Neurocomputing
Scalable Clustering for Mining Local-Correlated Clusters in High Dimensions and Large Datasets

Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
2010 Special Issue: Visualization of multi-neuron activity by simultaneous optimization of clustering and dimension reduction

Neural Networks
SSPS: A Semi-Supervised Pattern Shift for Classification

Neural Processing Letters
Learning multiple nonredundant clusterings

ACM Transactions on Knowledge Discovery from Data (TKDD)
Image analysis with nonlinear adaptive dimension reduction

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
A probabilistic clustering-projection model for discrete data

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
A robust seedless algorithm for correlation clustering

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Clustering high dimensional data

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Registrar: a complete-memory operator to enhance performance of genetic algorithms

Journal of Global Optimization
A survey on enhanced subspace clustering

Data Mining and Knowledge Discovery
Stock market co-movement assessment using a three-phase clustering method

Expert Systems with Applications: An International Journal
Tensor clustering via adaptive subspace iteration

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

It is well-known that for high dimensional data clustering, standard algorithms such as EM and the K -meansare often trapped in local minimum. Many initializationmethods were proposed to tackle this problem, but withonly limited success. In this paper we propose newapproach to resolve this problem by repeated dimension reductions such that K-means or EM are performedonly in very low dimensions.Cluster membership is utilized as a bridge between the reduced dimensional sub-space and the original space, providing flexibility andease of implementation. Clustering analysis performedon highly overlapped Gaussians, DNA gene expressionprofiles and internet newsgroups demonstrate the effectiveness of the proposed algorithm.