A symbolic representation of time series, with implications for streaming algorithms
DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Constructing internet coordinate system based on delay measurement
Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
Subspace clustering for high dimensional data: a review
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Document clustering via adaptive subspace iteration
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Meanshift Clustering for DNA Microarray Analysis
CSB '04 Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference
Constructing internet coordinate system based on delay measurement
IEEE/ACM Transactions on Networking (TON)
A general model for clustering binary data
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Adaptive dimension reduction using discriminant analysis and K-means clustering
Proceedings of the 24th international conference on Machine learning
Experiencing SAX: a novel symbolic representation of time series
Data Mining and Knowledge Discovery
SCHISM: a new approach to interesting subspace mining
International Journal of Business Intelligence and Data Mining
International Journal of Business Intelligence and Data Mining
Efficient layered density-based clustering of categorical data
Journal of Biomedical Informatics
An island model for high-dimensional genomes using phylogenetic speciation and species barcoding
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Subspace maximum margin clustering
Proceedings of the 18th ACM conference on Information and knowledge management
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Clustering algorithms optimizer: a framework for large datasets
ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
An adaptive and efficient unsupervised shot clustering algorithm for sports video
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Scalable Clustering for Mining Local-Correlated Clusters in High Dimensions and Large Datasets
Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
SSPS: A Semi-Supervised Pattern Shift for Classification
Neural Processing Letters
Learning multiple nonredundant clusterings
ACM Transactions on Knowledge Discovery from Data (TKDD)
Image analysis with nonlinear adaptive dimension reduction
Proceedings of the Third International Conference on Internet Multimedia Computing and Service
A probabilistic clustering-projection model for discrete data
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
A robust seedless algorithm for correlation clustering
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Clustering high dimensional data
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Registrar: a complete-memory operator to enhance performance of genetic algorithms
Journal of Global Optimization
A survey on enhanced subspace clustering
Data Mining and Knowledge Discovery
Stock market co-movement assessment using a three-phase clustering method
Expert Systems with Applications: An International Journal
Tensor clustering via adaptive subspace iteration
Intelligent Data Analysis
Hi-index | 0.00 |
It is well-known that for high dimensional data clustering, standard algorithms such as EM and the K -meansare often trapped in local minimum. Many initializationmethods were proposed to tackle this problem, but withonly limited success. In this paper we propose newapproach to resolve this problem by repeated dimension reductions such that K-means or EM are performedonly in very low dimensions.Cluster membership is utilized as a bridge between the reduced dimensional sub-space and the original space, providing flexibility andease of implementation. Clustering analysis performedon highly overlapped Gaussians, DNA gene expressionprofiles and internet newsgroups demonstrate the effectiveness of the proposed algorithm.