Hierarchical models for repeated binary data using the IBF sampler

  • Authors:
  • Ming Tan;Guo-Liang Tian;Kai Wang Ng

  • Affiliations:
  • Division of Biostatistics, Greenebaum Cancer Center, University of Maryland, 22 South Greene Street, Baltimore, MD 21201, USA;Division of Biostatistics, Greenebaum Cancer Center, University of Maryland, 22 South Greene Street, Baltimore, MD 21201, USA;Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2006

Quantified Score

Hi-index 0.03

Visualization

Abstract

Hierarchical models have emerged as a promising tool for the analysis of repeated binary data. However, the computational complexity in these models have limited their applications in practice. Several approaches have been proposed in the literature to overcome the computational difficulties including maximum likelihood estimation from a frequentist perspective (e.g., J. Amer. Statist. Assoc. 89 (1994) 330-335) and Markov chain Monte Carlo (MCMC) methods from a Bayesian perspective (e.g., Generalized Linear Models: A Bayesian Perspective, Marcel Dekker, New York, pp. 113-131). Although MCMC methods provide the whole posterior of the parameter of interest, the convergence diagnostics problem of the Markov chain and the slow convergence problem owing to the introduction of too many Gaussian latent variables are still unresolved. Recently, Tan et al. (Statist. Sinica 13 (2003) 625-639) proposed a noniterative sampling approach, the inverse Bayes formulae (IBF) sampler, for computing posteriors in the structure of EM algorithm. This article develops the IBF sampler in the structure of Monte Carlo EM (MCEM) for the hierarchical model with repeated binary data for which current methods encounter difficulty. An efficient IBF sampler is implemented by utilizing the estimated posterior modes obtained via MCEM algorithm. The proposed method generates independent and identically distributed (iid) samples approximately from the observed posterior distribution and thus alleviates the convergence problem associated with the MCMC methods. In addition, the slow convergence problem in Gibbs sampler can be bypassed in the noniterative IBF sampler via running some fast EM-type algorithm. Real datasets from six cities children's wheeze study and children's ear fluid study illustrate the proposed methods.