Sequential imputation for missing values

Authors:
Sabine Verboven;Karlien Vanden Branden;Peter Goos
Affiliations:
University of Antwerp, Department of Mathematics, Statistics & Actuarial Sciences, Prinsstraat 13, 2000 Antwerp, Belgium;Joint Research Centre, TP 361, 21020 Ispra, VA, Italy;University of Antwerp, Department of Mathematics, Statistics & Actuarial Sciences, Prinsstraat 13, 2000 Antwerp, Belgium
Venue:
Computational Biology and Chemistry
Year:
2007

Citing 4
Cited 4

Bayesian PCA

Proceedings of the 1998 conference on Advances in neural information processing systems II
Missing value estimation for DNA microarray gene expression data: local least squares imputation

Bioinformatics
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data

Bioinformatics
The influence of missing value imputation on detection of differentially expressed genes from microarray data

Bioinformatics

Research Article: Robust data imputation

Computational Biology and Chemistry
Detection of multivariate outliers in business survey data with incomplete information

Advances in Data Analysis and Classification
Iterative stepwise regression imputation using standard and robust methods

Computational Statistics & Data Analysis
Optimum estimation of missing values in randomized complete block design by genetic algorithm

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

As missing values are often encountered in gene expression data, many imputation methods have been developed to substitute these unknown values with estimated values. Despite the presence of many imputation methods, these available techniques have some disadvantages. Some imputation techniques constrain the imputation of missing values to a limited set of genes, whereas other imputation methods optimise a more global criterion whereby the computation time of the method becomes infeasible. Others might be fast but inaccurate. Therefore in this paper a new, fast and accurate estimation procedure, called SEQimpute, is proposed. By introducing the idea of minimisation of a statistical distance rather than a Euclidean distance the method is intrinsically different from the thus far existing imputation methods. Moreover, this newly proposed method can be easily embedded in a multiple imputation technique which is better suited to highlight the uncertainties about the missing value estimates. A comparative study is performed to assess the estimation of the missing values by different imputation approaches. The proposed imputation method is shown to outperform some of the existing imputation methods in terms of accuracy and computation speed.