An Introduction to Variational Methods for Graphical Models
Machine Learning
Unsupervised Learning of Finite Mixture Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Journal of Machine Learning Research
Journal of Multivariate Analysis
Simultaneous Feature Selection and Clustering Using Mixture Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Applications of beta-mixture models in bioinformatics
Bioinformatics
Bayesian Feature and Model Selection for Gaussian Mixture Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Pattern Analysis and Machine Intelligence
Simultaneous Localized Feature Selection and Model Detection for Gaussian Mixtures
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian Estimation of Beta Mixture Models with Variational Inference
IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised Anomaly Intrusion Detection via Localized Bayesian Feature Selection
ICDM '11 Proceedings of the 2011 IEEE 11th International Conference on Data Mining
IEEE Transactions on Image Processing
Hi-index | 0.00 |
In this paper, we propose a novel Bayesian nonparametric statistical approach of simultaneous clustering and localized feature selection for unsupervised learning. The proposed model is based on a mixture of Dirichlet processes with generalized Dirichlet (GD) distributions, which can also be seen as an infinite GD mixture model. Due to the nature of Bayesian nonparametric approach, the problems of overfitting and underfitting are prevented. Moreover, the determination of the number of clusters is sidestepped by assuming an infinite number of clusters. In our approach, the model parameters and the local feature saliency are estimated simultaneously by variational inference. We report experimental results of applying our model to two challenging clustering problems involving web pages and tissue samples which contain gene expressions.