On the influence of imputation in classification: practical issues
Journal of Experimental & Theoretical Artificial Intelligence
Missing Data Analysis: A Kernel-Based Multi-Imputation Approach
Transactions on Computational Science III
Information enhancement for data mining
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Hi-index | 0.00 |
Data with missing attribute-values are quite common in many classification problems. In this paper, we incorporate an Expectation-Maximization(EM) inspired approach for filling up missing values to decision tree learning with the objective of improving classification accuracy. Here, each missing attribute-value is iteratively filled using a predictor constructed from the known values and predicted values of the missing attribute-values from the previous iteration. We show that our approach significantly outperforms some standard machine learning methods for handling missing values in classification tasks.