Incorporating Nonlinear Relationships in Microarray Missing Value Imputation

Authors:
Tianwei Yu;Hesen Peng;Wei Sun
Affiliations:
Emory University, Atlanta;Emory University, Atlanta;University of North Carolina, Chapel Hill
Venue:
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Year:
2011

Citing 0
Cited 3

Locally linear reconstruction based missing value imputation for supervised learning

Neurocomputing
Customized prediction of respiratory motion with clustering from multiple patient interaction

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Hierarchical Clustering of High- Throughput Expression Data Based on General Dependences

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Microarray gene expression data often contain missing values. Accurate estimation of the missing values is important for downstream data analyses that require complete data. Nonlinear relationships between gene expression levels have not been well-utilized in missing value imputation. We propose an imputation scheme based on nonlinear dependencies between genes. By simulations based on real microarray data, we show that incorporating nonlinear relationships could improve the accuracy of missing value imputation, both in terms of normalized root-mean-squared error and in terms of the preservation of the list of significant genes in statistical testing. In addition, we studied the impact of artificial dependencies introduced by data normalization on the simulation results. Our results suggest that methods relying on global correlation structures may yield overly optimistic simulation results when the data have been subjected to row (gene)-wise mean removal.