WIMP: Web server tool for missing data imputation

Authors:
D. Urda;J. L. Subirats;P. J. García-Laencina;L. Franco;J. L. Sancho-Gómez;J. M. Jerez
Affiliations:
Departamento de Lenguajes y Ciencias de la Computación, ETSI Informática, University of Málaga, Spain;Departamento de Lenguajes y Ciencias de la Computación, ETSI Informática, University of Málaga, Spain;Centro Universitario de la Defensa de San Javier, MDE-UPCT, Spain;Departamento de Lenguajes y Ciencias de la Computación, ETSI Informática, University of Málaga, Spain;Departamento de Tecnologías de la Información y las Comunicaciones, Universidad Politécnica de Cartagena, Spain;Departamento de Lenguajes y Ciencias de la Computación, ETSI Informática, University of Málaga, Spain
Venue:
Computer Methods and Programs in Biomedicine
Year:
2012

Citing 16
Cited 0

Statistical analysis with missing data

Statistical analysis with missing data
C4.5: programs for machine learning

C4.5: programs for machine learning
Missing value estimation for DNA microarray gene expression data: local least squares imputation

Bioinformatics
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data

Bioinformatics
The influence of missing value imputation on detection of differentially expressed genes from microarray data

Bioinformatics
On Classification with Incomplete Data

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploiting missing clinical data in Bayesian network modeling for predicting medical problems

Journal of Biomedical Informatics
Confidence intervals for marginal parameters under fractional linear regression imputation for missing data

Journal of Multivariate Analysis
Ameliorative missing value imputation for robust biological knowledge inference

Journal of Biomedical Informatics
Sequential local least squares imputation estimating missing value of microarray data

Computers in Biology and Medicine
K nearest neighbours with mutual information for simultaneous classification and missing data imputation

Neurocomputing
Autoregressive-model-based missing value estimation for DNA microarray time series data

IEEE Transactions on Information Technology in Biomedicine
Partial identification with missing data: concepts and findings

International Journal of Approximate Reasoning
Pattern classification with missing data: a review

Neural Computing and Applications - Special Issue - KES2008
Predicting incomplete gene microarray data with the use of supervised learning algorithms

Pattern Recognition Letters
Missing data imputation using statistical and machine learning methods in a real breast cancer problem

Artificial Intelligence in Medicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

The imputation of unknown or missing data is a crucial task on the analysis of biomedical datasets. There are several situations where it is necessary to classify or identify instances given incomplete vectors, and the existence of missing values can much degrade the performance of the algorithms used for the classification/recognition. The task of learning accurately from incomplete data raises a number of issues some of which have not been completely solved in machine learning applications. In this sense, effective missing value estimation methods are required. Different methods for missing data imputations exist but most of the times the selection of the appropriate technique involves testing several methods, comparing them and choosing the right one. Furthermore, applying these methods, in most cases, is not straightforward, as they involve several technical details, and in particular in cases such as when dealing with microarray datasets, the application of the methods requires huge computational resources. As far as we know, there is not a public software application that can provide the computing capabilities required for carrying the task of data imputation. This paper presents a new public tool for missing data imputation that is attached to a computer cluster in order to execute high computational tasks. The software WIMP (Web IMPutation) is a public available web site where registered users can create, execute, analyze and store their simulations related to missing data imputation.