Dimension reduction for model-based clustering via mixtures of multivariate $$t$$t-distributions

  • Authors:
  • Katherine Morris;Paul D. Mcnicholas;Luca Scrucca

  • Affiliations:
  • Department of Mathematics and Statistics, University of Guelph, Ontario, Canada;Department of Mathematics and Statistics, University of Guelph, Ontario, Canada;Dipartimento di Economia, Finanza e Statistica, Università degli Studi di Perugia, Perugia, Italy

  • Venue:
  • Advances in Data Analysis and Classification
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a dimension reduction method for model-based clustering obtained from a finite mixture of $$t$$t-distributions. This approach is based on existing work on reducing dimensionality in the case of finite Gaussian mixtures. The method relies on identifying a reduced subspace of the data by considering the extent to which group means and group covariances vary. This subspace contains linear combinations of the original data, which are ordered by importance via the associated eigenvalues. Observations can be projected onto the subspace and the resulting set of variables captures most of the clustering structure available in the data. The approach is illustrated using simulated and real data, where it outperforms its Gaussian analogue.