Clustering student skill set profiles in a unit hypercube using mixtures of multivariate betas

Authors:
Nema Dean;Rebecca Nugent
Affiliations:
School of Mathematics and Statistics, University of Glasgow, Glasgow, UK G12 8QQ;Department of Statistics, Carnegie Mellon University, Pittsburgh, USA 15213
Venue:
Advances in Data Analysis and Classification
Year:
2013

Citing 3
Cited 0

Applications of beta-mixture models in bioinformatics

Bioinformatics
Variable selection in model-based clustering: A general variable role modeling

Computational Statistics & Data Analysis
Addressing the assessment challenge with an online system that tutors as it assesses

User Modeling and User-Adapted Interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a finite mixture of multivariate betas as a new model-based clustering method tailored to applications where the feature space is constrained to the unit hypercube. The mixture component densities are taken to be conditionally independent, univariate unimodal beta densities (from the subclass of reparameterized beta densities given by Bagnato and Punzo in Comput Stat 28(4):10.1007/s00180-012-367-4, 2013). The EM algorithm used to fit this mixture is discussed in detail, and results from both this beta mixture model and the more standard Gaussian model-based clustering are presented for simulated skill mastery data from a common cognitive diagnosis model and for real data from the Assistment System online mathematics tutor (Feng et al. in J User Model User Adap Inter 19(3):243---266, 2009). The multivariate beta mixture appears to outperform the standard Gaussian model-based clustering approach, as would be expected on the constrained space. Fewer components are selected (by BIC-ICL) in the beta mixture than in the Gaussian mixture, and the resulting clusters seem more reasonable and interpretable.