Robust estimation of mixture complexity for count data

Authors:
Mi-Ja Woo;T. N. Sriram
Affiliations:
Department of Statistics, University of Georgia, Athens, GA 30602-1952, USA;Department of Statistics, University of Georgia, Athens, GA 30602-1952, USA
Venue:
Computational Statistics & Data Analysis
Year:
2007

Citing 3
Cited 6

Minumum Hellinger distance estimation for Poisson mixtures

Computational Statistics & Data Analysis
Alternating kernel and mixture density estimates

Computational Statistics & Data Analysis
Mixtures of Factor Analyzers

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning

L2E estimation of mixture complexity for count data

Computational Statistics & Data Analysis
Minimum Hellinger distance estimation in a two-sample semiparametric model

Journal of Multivariate Analysis
The L1-consistency of Dirichlet mixtures in multivariate Bayesian density estimation

Journal of Multivariate Analysis
The robust estimation method for a finite mixture of Poisson mixed-effect models

Computational Statistics & Data Analysis
Efficient Hellinger distance estimates for semiparametric models

Journal of Multivariate Analysis
Minimum distance estimation in a finite mixture regression model

Journal of Multivariate Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

For finite mixtures, consistent estimation of unknown number of components, called mixture complexity, is considered based on a random sample of counts, when the exact form of component probability mass functions are unknown but are postulated to belong to some parametric family. Following a recent approach of Woo and Sriram [2006. Robust estimation of mixture complexity. J. Amer. Statist. Assoc., to appear.], we develop an estimator of mixture complexity as a by-product of minimizing a Hellinger information criterion, when all the parameters associated with the mixture model are unknown. The estimator is shown to be consistent. Monte Carlo simulations illustrate the ability of our estimator to correctly determine the mixture complexity when the postulated Poisson mixture model is correct. When the postulated model is a Poisson mixture but the data comes from a negative binomial mixture with moderate to more extreme overdispersion in one of its components, simulation results show that our estimator continues to perform well. These confirm the efficiency of the estimator when the model is correctly specified and the robustness when the model is incorrectly specified. A count dataset with overdispersion and possible zero inflation is analyzed to further illustrate the ability of our estimator to determine the number of components.