Mixed-membership naive Bayes models

Authors:
Hanhuai Shan;Arindam Banerjee
Affiliations:
Department of Computer Science & Engineering, University of Minnesota, Twin Cities, Minneapolis, USA;Department of Computer Science & Engineering, University of Minnesota, Twin Cities, Minneapolis, USA
Venue:
Data Mining and Knowledge Discovery
Year:
2011

Citing 22
Cited 0

On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
A view of the EM algorithm that justifies incremental, sparse, and other variants

Learning in graphical models
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Latent dirichlet allocation

The Journal of Machine Learning Research
Information-theoretic co-clustering

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning to Decode Cognitive States from Brain Images

Machine Learning
An information theoretic analysis of maximum likelihood mixture estimation for exponential families

ICML '04 Proceedings of the twenty-first international conference on Machine learning
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Model-based overlapping clustering

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Clustering on the Unit Hypersphere using von Mises-Fisher Distributions

The Journal of Machine Learning Research
A latent variable model for chemogenomic profiling

Bioinformatics
Dynamic topic models

ICML '06 Proceedings of the 23rd international conference on Machine learning
Clustering with Bregman Divergences

The Journal of Machine Learning Research
Naïve Bayes for microRNA target predictions—machine learning for microRNA targets

Bioinformatics
Statistical models for partial membership

Proceedings of the 25th international conference on Machine learning
Fast collapsed gibbs sampling for latent dirichlet allocation

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Mixed Membership Stochastic Blockmodels

The Journal of Machine Learning Research
Bayesian Co-clustering

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
A Generative Probabilistic Model for Multi-label Classification

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Multiplicative Mixture Models for Overlapping Clustering

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, mixture models have found widespread usage in discovering latent cluster structure from data. A popular special case of finite mixture models is the family of naive Bayes (NB) models, where the probability of a feature vector factorizes over the features for any given component of the mixture. Despite their popularity, naive Bayes models do not allow data points to belong to different component clusters with varying degrees, i.e., mixed memberships, which puts a restriction on their modeling ability. In this paper, we propose mixed-membership naive Bayes (MMNB) models. On one hand, MMNB can be viewed as a generalization of NB by putting a Dirichlet prior on top to allow mixed memberships. On the other hand, MMNB can also be viewed as a generalization of latent Dirichlet allocation (LDA) with the ability to handle heterogeneous feature vectors with different types of features, e.g., real, categorical, etc.. We propose two variational inference algorithms to learn MMNB models. The first one is based on ideas originally used in LDA, and the second one uses substantially fewer variational parameters, leading to a significantly faster algorithm. Further, we extend MMNB/LDA to discriminative mixed-membership models for classification by suitably combining MMNB/LDA with multi-class logistic regression. The efficacy of the proposed mixed-membership models is demonstrated by extensive experiments on several datasets, including UCI benchmarks, recommendation systems, and text datasets.