Handling incomplete categorical data for supervised learning

  • Authors:
  • Been-Chian Chien;Cheng-Feng Lu;Steen J. Hsu

  • Affiliations:
  • Department of Computer Science and Information Engineering, National University of Tainan, Tainan, Taiwan, R.O.C.;Department of Information Engineering, I-Shou University, Kaohsiung, Taiwan, R.O.C.;Department of Information Management, Ming Hsin University of Science and Technology, Hsin-Chu, Taiwan, R.O.C.

  • Venue:
  • IEA/AIE'06 Proceedings of the 19th international conference on Advances in Applied Artificial Intelligence: industrial, Engineering and Other Applications of Applied Intelligent Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Classification is an important research topic in knowledge discovery. Most of the researches on classification concern that a complete dataset is given as a training dataset and the test data contain all values of attributes without missing. Unfortunately, incomplete data usually exist in real-world applications. In this paper, we propose new handling schemes of learning classification models from incomplete categorical data. Three methods based on rough set theory are developed and discussed for handling incomplete training data. The experiments were made and the results were compared with previous methods making use of a few famous classification models to evaluate the performance of the proposed handling schemes.