Missing Data Analysis: A Kernel-Based Multi-Imputation Approach

  • Authors:
  • Shichao Zhang;Zhi Jin;Xiaofeng Zhu;Jilian Zhang

  • Affiliations:
  • College of CS and IT, Guangxi Normal University, China and Faculty of EIT, UTS, Australia Broadway NSW 2007 and State Key Lab for Novel Software Technology, Nanjing University, PR China;School of EE and CS, Peking University, PR China;College of CS and IT, Guangxi Normal University, China;College of CS and IT, Guangxi Normal University, China

  • Venue:
  • Transactions on Computational Science III
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many missing data analysis techniques are of single-imputation. However, single-imputation cannot provide valid standard errors and confidence intervals, since it ignores the uncertainty implicit in the fact that the imputed values are not the actual values. Filling in each missing value with a set of plausible values is called multi-imputation. In this paper we propose a kernel-based stochastic non-parametric multi-imputation method under MAR (Missing at Random) and MCAR (Missing Completely at Random) missing mechanisms in nonparametric regression settings. Furthermore, we present a kernel-based stochastic semi-parametric multi-imputation method while we have some priori knowledge about the dataset with missing. Our algorithms are designed specifically with the aim of optimizing the confidence-interval and the relative efficiency. The proposed technique is evaluated by experimentations, using simulation data and real data, and the results demonstrate that our method performs much better than the NORM method, and is promising.