Collapsed variational Dirichlet process mixture models

Authors:
Kenichi Kurihara;Max Welling;Yee Whye Teh
Affiliations:
Dept. of Computer Science, Tokyo Institute of Technology, Japan;Dept. of Computer Science, UC Irvine;Dept. of Computer Science, National University of Singapore
Venue:
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Year:
2007

Citing 2
Cited 5

Very fast EM-based mixture model clustering using multiresolution kd-trees

Proceedings of the 1998 conference on Advances in neural information processing systems II
Learning Mixtures of Gaussians

FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science

A permutation-augmented sampler for DP mixture models

Proceedings of the 24th international conference on Machine learning
Nonparametric Bayesian clustering ensembles

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Classification with Incomplete Data Using Dirichlet Process Priors

The Journal of Machine Learning Research
Sequential minimal optimization in convex clustering repetitions

Statistical Analysis and Data Mining
Quantum annealing for Dirichlet process mixture models with applications to network clustering

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nonparametric Bayesian mixture models, in particular Dirichlet process (DP) mixture models, have shown great promise for density estimation and data clustering. Given the size of today's datasets, computational efficiency becomes an essential ingredient in the applicability of these techniques to real world data. We study and experimentally compare a number of variational Bayesian (VB) approximations to the DP mixture model. In particular we consider the standard VB approximation where parameters are assumed to be independent from cluster assignment variables, and a novel collapsed VB approximation where mixture weights are marginalized out. For both VB approximations we consider two different ways to approximate the DP, by truncating the stick-breaking construction, and by using a finite mixture model with a symmetric Dirichlet prior.