Inference in model-based cluster analysis

Authors:
Halima Bensmail;Gilles Celeux;Adrian E. Raftery;Christian P. Robert
Affiliations:
Department of Statistics, University of Washington, Box 354322, Seattle, WA 98195-4322, USA;INRIA Rhoˆne-Alpes, ZIRST, 655 Avenue de l‘Europe, 38330 Montbonnet Saint-Martin, France;Department of Statistics, University of Washington, Box 354322, Seattle, WA 98195-4322, USA;CREST, INSEE, 3 Avenue Pierre Larousse, 92245 Malakoff Cedex, France
Venue:
Statistics and Computing
Year:
1997

Citing 0
Cited 14

Unsupervised Learning of Finite Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
On Fitting Mixture Models

EMMCVPR '99 Proceedings of the Second International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition
Learning a multivariate Gaussian mixture model with the reversible jump MCMC algorithm

Statistics and Computing
Statistical Performance Evaluation of Biometric Authentication Systems Using Random Effects Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
High-Dimensional Unsupervised Selection and Estimation of a Finite Generalized Dirichlet Mixture Model Based on Minimum Message Length

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian density estimation using skew student-t-normal mixtures

Computational Statistics & Data Analysis
Online clustering via finite mixtures of Dirichlet and minimum message length

Engineering Applications of Artificial Intelligence
Bayesian hybrid generative discriminative learning based on finite Liouville mixture models

Pattern Recognition
Discriminative structure selection method of Gaussian Mixture Models with its application to handwritten digit recognition

Neurocomputing
Density-based Silhouette diagnostics for clustering methods

Statistics and Computing
Flexible mixture modeling via the multivariate t distribution with the Box-Cox transformation: an alternative to the skew-t distribution

Statistics and Computing
Modeling phase spectra using gaussian mixture models for human face identification

ICAPR'05 Proceedings of the Third international conference on Pattern Recognition and Image Analysis - Volume Part II
Density estimation using mixtures of mixtures of gaussians

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Learning from incomplete data via parameterized t mixture models through eigenvalue decomposition

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

A new approach to cluster analysis has been introduced based on parsimonious geometric modelling of the within-group covariance matrices in a mixture of multivariate normal distributions, using hierarchical agglomeration and iterative relocation. It works well and is widely used via the MCLUST software available in S-PLUS and StatLib. However, it has several limitations: there is no assessment of the uncertainty about the classification, the partition can be suboptimal, parameter estimates are biased, the shape matrix has to be specified by the user, prior group probabilities are assumed to be equal, the method for choosing the number of groups is based on a crude approximation, and no formal way of choosing between the various possible models is included. Here, we propose a new approach which overcomes all these difficulties. It consists of exact Bayesian inference via Gibbs sampling, and the calculation of Bayes factors (for choosing the model and the number of groups) from the output using the Laplace–Metropolis estimator. It works well in several real and simulated examples.