A bootstrap-based aggregate classifier for model-based clustering

  • Authors:
  • José G. Dias;Jeroen K. Vermunt

  • Affiliations:
  • Department of Quantitative Methods, ISCTE, Higher Institute of Social Sciences and Business Studies and UNIDE, Edifício ISCTE, Lisboa, Portugal 1649-026;Department of Methodology and Statistics, Tilburg University, Tilburg, The Netherlands 5000 LE

  • Venue:
  • Computational Statistics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In model-based clustering, a situation in which true class labels are unknown and that is therefore also referred to as unsupervised learning, observations are typically classified by the Bayes modal rule. In this study, we assess whether alternative classifiers from the classification or supervised-learning literature--developed for situations in which class labels are known--can improve the Bayes rule. More specifically, we investigate the performance of bootstrap-based aggregate (bagging) rules after adapting these to the model-based clustering context. It is argued that specific issues, such as the label-switching problem, have to be carefully addressed when using bootstrap methods in model-based clustering. Our two Monte Carlo studies show that classification based on the Bayes rule is rather stable and difficult to improve by bootstrap-based aggregate rules, even for sparse data. An empirical example illustrates the various approaches described in this paper.