A simulation study comparing weighted estimating equations with multiple imputation based estimating equations for longitudinal binary data

  • Authors:
  • Caroline Beunckens;Cristina Sotto;Geert Molenberghs

  • Affiliations:
  • Center for Statistics, Hasselt University, Agoralaan 1, Building D, 3590 Diepenbeek, Belgium;Center for Statistics, Hasselt University, Agoralaan 1, Building D, 3590 Diepenbeek, Belgium and School of Statistics, University of the Philippines, Diliman, Quezon City, Philippines;Center for Statistics, Hasselt University, Agoralaan 1, Building D, 3590 Diepenbeek, Belgium

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2008

Quantified Score

Hi-index 0.03

Visualization

Abstract

Missingness frequently complicates the analysis of longitudinal data. A popular solution for dealing with incomplete longitudinal data is the use of likelihood-based methods, when, for example, linear, generalized linear, or non-linear mixed models are considered, due to their validity under the assumption of missing at random (MAR). Semi-parametric methods such as generalized estimating equations (GEEs) offer another attractive approach but require the assumption of missing completely at random (MCAR). Weighted GEE (WGEE) has been proposed as an elegant way to ensure validity under MAR. Alternatively, multiple imputation (MI) can be used to pre-process incomplete data, after which GEE is applied (MI-GEE). Focusing on incomplete binary repeated measures, both methods are compared using the so-called asymptotic, as well as small-sample, simulations, in a variety of correctly specified as well as incorrectly specified models. In spite of the asymptotic unbiasedness of WGEE, results provide striking evidence that MI-GEE is both less biased and more accurate in the small to moderate sample sizes which typically arise in clinical trials.