Global optimization, meta clustering and consensus clustering for class prediction

Authors:
Ida Bifulco;Carmine Fedullo;Francesco Napolitano;Giancarlo Raiconi;Roberto Tagliaferri
Affiliations:
Dipartimento di Matematica ed lnformatica, University of Salerno, Italy;Dipartimento di Matematica ed lnformatica, University of Salerno, Italy;Dipartimento di Matematica ed lnformatica, University of Salerno, Italy;Dipartimento di Matematica ed lnformatica, University of Salerno, Italy;Dipartimento di Matematica ed lnformatica, University of Salerno, Italy
Venue:
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Year:
2009

Citing 12
Cited 0

A New Version of the Price‘s Algorithm for Global Optimization

Journal of Global Optimization
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions

The Journal of Machine Learning Research
Adaptive Clustering Ensembles

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
k-means projective clustering

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Genetic approach helps to speed classical Price algorithm for global optimization

Soft Computing - A Fusion of Foundations, Methodologies and Applications
Clustering Ensembles: Models of Consensus and Weak Partitions

IEEE Transactions on Pattern Analysis and Machine Intelligence
Meta Clustering

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Clustering aggregation

ACM Transactions on Knowledge Discovery from Data (TKDD)
Global optimization in clustering using hyperbolic cross points

Pattern Recognition
Clustering and visualization approaches for human cell cycle gene expression data analysis

International Journal of Approximate Reasoning
2008 Special Issue: Interactive data analysis and clustering of genomic data

Neural Networks
Consensus Clusterings

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Clustering of real-world data is often ill-posed. Because of noise and intrinsic ambiguity in data, optimization models attempting to maximize a fitness function can be misled by the assumption of uniqueness of the solution. In this work we present a methodology including classic and novel techniques to approach clustering in a systematic way, with two application examples to biological data sets. The methodology is based on a process that generates multiple clustering solutions (using global optimization), performs cluster analysis on such clusterings (i.e. Meta Clustering) and analyzes the obtained clusterings by the appropriate application of different consensus techniques. In order to validate the method, we seek for the solutions that best match the real class labels, exploiting only a random sample of them. Finally, we guess the class labels of the remaining patterns using cluster enrichment information and verify the percentage of correct assignments for each class. The optimization of clustering objective functions together with the use of partial labeling puts the described approach in between unsupervised and semi-supervised methods.