Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

  • Authors:
  • Charles Bouveyron;Camille Brunet

  • Affiliations:
  • Laboratoire SAMM, EA 4543, Université Paris 1 Panthéon-Sorbonne, Paris, France 75013;IBISC, TADIB, FRE CNRS 3190, Université d'Evry Val d'Essonne, Evry Courcouronnes, France 91020

  • Venue:
  • Statistics and Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering in high-dimensional spaces is nowadays a recurrent problem in many scientific domains but remains a difficult task from both the clustering accuracy and the result understanding points of view. This paper presents a discriminative latent mixture (DLM) model which fits the data in a latent orthonormal discriminative subspace with an intrinsic dimension lower than the dimension of the original space. By constraining model parameters within and between groups, a family of 12 parsimonious DLM models is exhibited which allows to fit onto various situations. An estimation algorithm, called the Fisher-EM algorithm, is also proposed for estimating both the mixture parameters and the discriminative subspace. Experiments on simulated and real datasets highlight the good performance of the proposed approach as compared to existing clustering methods while providing a useful representation of the clustered data. The method is as well applied to the clustering of mass spectrometry data.