Imputation through finite Gaussian mixture models

  • Authors:
  • Marco Di Zio;Ugo Guarnera;Orietta Luzi

  • Affiliations:
  • Istituto Nazionale di Statistica, via Cesare Balbo 16, 00184 Roma, Italy;Istituto Nazionale di Statistica, via Cesare Balbo 16, 00184 Roma, Italy;Istituto Nazionale di Statistica, via Cesare Balbo 16, 00184 Roma, Italy

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2007

Quantified Score

Hi-index 0.03

Visualization

Abstract

Imputation is a widely used method for handling missing data. It consists in the replacement of missing values with plausible ones. Parametric and nonparametric techniques are generally adopted for modelling incomplete data. Both of them have advantages and drawbacks. Parametric techniques are parsimonious but depend on the model assumed, while nonparametric techniques are more flexible but require a high amount of observations. The use of finite mixture of multivariate Gaussian distributions for handling missing data is proposed. The main reason is that it allows to control the trade-off between parsimony and flexibility. An experimental comparison with the widely used imputation nearest neighbour donor is illustrated.