Parameter setting for evolutionary latent class clustering

  • Authors:
  • Damien Tessier;Marc Schoenauer;Christophe Biernacki;Gilles Celeux;Gérard Govaert

  • Affiliations:
  • Projet TAO;Projet TAO;Laboratoire de Mathématiques Paul Painlevé, USTL, France;Heudiasyc, UTC, France;Projet SELECT, INRIA Futurs, France

  • Venue:
  • ISICA'07 Proceedings of the 2nd international conference on Advances in computation and intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The latent class model or multivariate multinomial mixture is a powerful model for clustering discrete data. This model is expected to be useful to represent non-homogeneous populations. It uses a conditional independence assumption given the latent class to which a statistical unit is belonging. However, it leads to a criterion that proves difficult to optimise by the standard approach based on the EM algorithm. An Evolutionary Algorithms is designed to tackle this discrete optimisation problem, and an extensive parameter study on a large artificial dataset allows to derive stable parameters. Those parameters are then validated on other artificial datasets, as well as on some well-known real data: the Evolutionary Algorithm performs repeatedly better than other standard clustering techniques on the same data.