Don't care values in induction

Authors:
Nikolaos Diamantidis;E. A. Giakoumakis
Affiliations:
-;-
Venue:
Artificial Intelligence in Medicine
Year:
1996

Citing 11
Cited 0

Concept learning and heuristic classification in weak-theory domains

Artificial Intelligence
Unknown attribute values in induction

Proceedings of the sixth international workshop on Machine learning
On changing continuous attributes into ordered discrete attributes

EWSL-91 Proceedings of the European working session on learning on Machine learning
C4.5: programs for machine learning

C4.5: programs for machine learning
Very Simple Classification Rules Perform Well on Most Commonly Used Datasets

Machine Learning
The CN2 Induction Algorithm

Machine Learning
Induction of Decision Trees

Machine Learning
Concept learning and the problem of small disjuncts

IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
Sparse data and the effect of overfitting avoidance in decision tree induction

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
Extraction of diagnostic rules using recursive partitioning systems: A comparison of two approaches

Artificial Intelligence in Medicine
Paper: Medical decision making based on inductive learning method

Artificial Intelligence in Medicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Inductive learning algorithms are powerful tools for the extraction of knowledge from data. Their success in medical domains is well-known. In medical diagnosis domains and generally in real-world applications among other problems, inductive learning algorithms have to deal with unknown values. In most cases unknown values are treated as missing ones. i.e. unknown values which are related to the class of training examples, but are missing due to lack of measurements. In this paper we address the problem of don't care values, which are unknown, because they are irrelevant to the class of the examples. The distinction of don't care values and missing ones is important in medical domains. With this distinction the experts are able to relate each diagnosis to the appropriate subset of attributes. We present techniques for dealing efficiently with don't care values in the induction of decision trees. Furthermore, we examine the importance of the distinction between missing and don't care values and we investigate the existence of don't care values instead of missing ones, in medical and non-medical real-world datasets.