Multiple clustering solutions analysis through least-squares consensus algorithms

  • Authors:
  • Loredana Murino;Claudia Angelini;Ida Bifulco;Italia De Feis;Giancarlo Raiconi;Roberto Tagliaferri

  • Affiliations:
  • NeuRoNe Lab, DMI, Fisciano, SA, Italy;Istituto per le Applicazioni del Calcolo 'Mauro Picone', CNR, Napoli, Italy;NeuRoNe Lab, DMI, Fisciano, SA, Italy;Istituto per le Applicazioni del Calcolo 'Mauro Picone', CNR, Napoli, Italy;NeuRoNe Lab, DMI, Fisciano, SA, Italy;NeuRoNe Lab, DMI, Fisciano, SA, Italy

  • Venue:
  • CIBB'09 Proceedings of the 6th international conference on Computational intelligence methods for bioinformatics and biostatistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering is one of the most important unsupervised learning problems and it deals with finding a structure in a collection of unlabeled data; however, different clustering algorithms applied to the same data-set produce different solutions. In many applications the problem of multiple solutions becomes crucial and providing a limited group of good clusterings is often more desirable than a single solution. In this work we propose the Least Square Consensus clustering that allows a user to extrapolate a small number of different clustering solutions from an initial (large) set of solutions obtained by applying any clustering algorithm to a given data-set. Two different implementations are presented. In both cases, each consensus is accomplished with a measure of quality defined in terms of Least Square error and a graphical visualization is provided in order to make immediately interpretable the result. Numerical experiments are carried out on both synthetic and real data-sets.