Pattern Recognition with Fuzzy Objective Function Algorithms
Pattern Recognition with Fuzzy Objective Function Algorithms
Pattern Recognition Letters
Refining Initial Points for K-Means Clustering
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Biclustering of Expression Data
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Improving clustering stability with combinatorial MRFs
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Nonnegative Decompositions with Resampling for Improving Gene Expression Data Biclustering Stability
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Stable biclustering of gene expression data with nonnegative matrix factorizations
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Generalized clustergrams for overlapping biclusters
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Stability-based validation of bicluster solutions
Pattern Recognition
Hi-index | 0.00 |
Although very widely used in unsupervised data mining, most clustering methods are affected by the instability of the resulting clusters w.r.t. the initialization of the algorithm (as e.g. in k-means). Here we show that this problem can be elegantly and efficiently tackled by meta-clustering the clusters produced in several different runs of the algorithm, especially if “soft” clustering algorithms (such as Nonnegative Matrix Factorization) are used both at the object- and the meta-level. The essential difference w.r.t. other meta-clustering approaches consists in the fact that our algorithm detects frequently occurring sub-clusters (rather than complete clusters) in the various runs, which allows it to outperform existing algorithms. Additionally, we show how to perform two-way meta-clustering, i.e. take both object and sample dimensions of clusters simultaneously into account, a feature which is essential e.g. for biclustering gene expression data, but has not been considered before.