Model-based clustering with non-elliptically contoured distributions

  • Authors:
  • Dimitris Karlis;Anais Santourian

  • Affiliations:
  • Department of Statistics, Athens University of Economics and Business, Athens, Greece 10434;Department of Statistics, Athens University of Economics and Business, Athens, Greece 10434

  • Venue:
  • Statistics and Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The majority of the existing literature on model-based clustering deals with symmetric components. In some cases, especially when dealing with skewed subpopulations, the estimate of the number of groups can be misleading; if symmetric components are assumed we need more than one component to describe an asymmetric group. Existing mixture models, based on multivariate normal distributions and multivariate t distributions, try to fit symmetric distributions, i.e. they fit symmetric clusters. In the present paper, we propose the use of finite mixtures of the normal inverse Gaussian distribution (and its multivariate extensions). Such finite mixture models start from a density that allows for skewness and fat tails, generalize the existing models, are tractable and have desirable properties. We examine both the univariate case, to gain insight, and the multivariate case, which is more useful in real applications. EM type algorithms are described for fitting the models. Real data examples are used to demonstrate the potential of the new model in comparison with existing ones.