Incorporating an EM-Approach for Handling Missing Attribute-Values in Decision Tree Induction

  • Authors:
  • Amitava Karmaker

  • Affiliations:
  • University of Texas at San Antonio

  • Venue:
  • HIS '05 Proceedings of the Fifth International Conference on Hybrid Intelligent Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data with missing attribute-values are quite common in many classification problems. In this paper, we incorporate an Expectation-Maximization(EM) inspired approach for filling up missing values to decision tree learning with the objective of improving classification accuracy. Here, each missing attribute-value is iteratively filled using a predictor constructed from the known values and predicted values of the missing attribute-values from the previous iteration. We show that our approach significantly outperforms some standard machine learning methods for handling missing values in classification tasks.