A combined overdispersed and marginalized multilevel model

  • Authors:
  • Samuel Iddi;Geert Molenberghs

  • Affiliations:
  • I-BioStat, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium;I-BioStat, Universiteit Hasselt, Agoralaan 1, B-3590 Diepenbeek, Belgium and I-BioStat, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2012

Quantified Score

Hi-index 0.03

Visualization

Abstract

Overdispersion and correlation are two features often encountered when modeling non-Gaussian dependent data, usually as a function of known covariates. Methods that ignore the presence of these phenomena are often in jeopardy of leading to biased assessment of covariate effects. The beta-binomial and negative binomial models are well known in dealing with overdispersed data for binary and count data, respectively. Similarly, generalized estimating equations (GEE) and the generalized linear mixed models (GLMM) are popular choices when analyzing correlated data. A so-called combined model simultaneously acknowledges the presence of dependency and overdispersion by way of two separate sets of random effects. A marginally specified logistic-normal model for longitudinal binary data which combines the strength of the marginal and hierarchical models has been previously proposed. These two are brought together to produce a marginalized longitudinal model which brings together the comfort of marginally meaningful parameters and the ease of allowing for overdispersion and correlation. Apart from model formulation, estimation methods are discussed. The proposed model is applied to two clinical studies and compared to the existing approach. It turns out that by explicitly allowing for overdispersion random effect, the model significantly improves.