The bootstrap approach to clustering
Proc. of the NATO Advanced Study Institute on Pattern recognition theory and applications
Algorithms for clustering data
Algorithms for clustering data
Machine Learning
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
A Data-Clustering Algorithm on Distributed Memory Multiprocessors
Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
Data Resampling for Path Based Clustering
Proceedings of the 24th DAGM Symposium on Pattern Recognition
Path-Based Clustering for Grouping of Smooth Curves and Texture Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Clustering Using Evidence Accumulation
ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 4 - Volume 4
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions
The Journal of Machine Learning Research
Combining Multiple Weak Clusterings
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Ensembles of Partitions via Data Resampling
ITCC '04 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2 - Volume 2
A clustering method based on boosting
Pattern Recognition Letters
Solving cluster ensemble problems by bipartite graph partitioning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Combining Multiple Clusterings Using Evidence Accumulation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
Resampling Method for Unsupervised Estimation of Cluster Validity
Neural Computation
A New Approach to Improve the Vote-Based Classifier Selection
NCM '08 Proceedings of the 2008 Fourth International Conference on Networked Computing and Advanced Information Management - Volume 02
NCM '08 Proceedings of the 2008 Fourth International Conference on Networked Computing and Advanced Information Management - Volume 02
CCHR: Combination of Classifiers Using Heuristic Retraining
NCM '08 Proceedings of the 2008 Fourth International Conference on Networked Computing and Advanced Information Management - Volume 02
Divide & Conquer Classification and Optimization by Genetic Algorithm
ICCIT '08 Proceedings of the 2008 Third International Conference on Convergence and Hybrid Information Technology - Volume 02
Neural Network Ensembles Using Clustering Ensemble and Genetic Algorithm
ICCIT '08 Proceedings of the 2008 Third International Conference on Convergence and Hybrid Information Technology - Volume 02
Characterization and evaluation of similarity measures for pairs of clusterings
Knowledge and Information Systems
Ensemble clustering using semidefinite programming with applications
Machine Learning
Using genetic algorithms for data mining optimization in an educational web-based system
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
A new multiobjective clustering technique based on the concepts of stability and symmetry
Knowledge and Information Systems
Coordination of Cluster Ensembles via Exact Methods
IEEE Transactions on Pattern Analysis and Machine Intelligence
Consensus of partitions: a constructive approach
Advances in Data Analysis and Classification
GAC-GEO: a generic agglomerative clustering framework for geo-referenced datasets
Knowledge and Information Systems
Hi-index | 0.00 |
Inspired by bagging and boosting algorithms in classification, the non-weighing and weighing-based sampling approaches for clustering are proposed and studied in the paper. The effectiveness of non-weighing-based sampling technique, comparing the efficacy of sampling with and without replacement, in conjunction with several consensus algorithms have been invested in this paper. Experimental results have shown improved stability and accuracy for clustering structures obtained via bootstrapping, subsampling, and boosting techniques. Subsamples of small size can reduce the computational cost and measurement complexity for many unsupervised data mining tasks with distributed sources of data. This empirical research study also compares the performance of boosting and bagging clustering ensembles using different consensus functions on a number of datasets.