A hierarchical mixture model for clustering three-way data sets

Authors:
Jeroen K. Vermunt
Affiliations:
Department of Methodology and Statistics, Tilburg University, P.O. Box 90153, 5000 LE Tilburg, The Netherlands
Venue:
Computational Statistics & Data Analysis
Year:
2007

Citing 4
Cited 6

Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Hierarchical mixtures of experts and the EM algorithm

Neural Computation
Latent class models for classification

Computational Statistics & Data Analysis
Hierarchical Latent Class Models for Cluster Analysis

The Journal of Machine Learning Research

Editorial: Advances in Mixture Models

Computational Statistics & Data Analysis
A dimensionally reduced finite mixture model for multilevel data

Journal of Multivariate Analysis
Finite mixtures of matrix normal distributions for classifying three-way data

Statistics and Computing
Model based clustering of customer choice data

Computational Statistics & Data Analysis
A hierarchical modeling approach for clustering probability density functions

Computational Statistics & Data Analysis
Constrained Multilevel Latent Class Models for the Analysis of Three-Way Three-Mode Binary Data

Journal of Classification

Quantified Score

Hi-index	0.03

Visualization

Abstract

Three-way data sets occur when various attributes are measured for a set of observational units in different situations. Examples are genotype by environment by attribute data obtained in a plant experiment, individual by time point by response data in a longitudinal study, and individual by brand by attribute data in a market research survey. Clustering observational units (genotypes/individuals) by means of a special type of the normal mixture model has been proposed. An implicit assumption of this approach is, however, that observational units are in the same cluster in all situations. An extension is presented that makes it possible to relax this assumption and that because of this may yield much simpler clustering solutions. The proposed extension-which includes the earlier model as a special case-is obtained by adapting the multilevel latent class model for categorical responses to the three-way situation, as well as to the situation in which responses include continuous variables. An efficient EM algorithm for parameter estimation by maximum likelihood is described and two empirical examples are provided.