Classification Strategies Using Certain and Possible Rules

Authors:
Jerzy W. Grzymala-Busse;Xihong Zou
Affiliations:
-;-
Venue:
RSCTC '98 Proceedings of the First International Conference on Rough Sets and Current Trends in Computing
Year:
1998

Citing 4
Cited 3

Induction: processes of inference, learning, and discovery

Induction: processes of inference, learning, and discovery
Classifier systems and genetic algorithms

Machine learning: paradigms and methods
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems

Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Set Analysis of Preference-Ordered Data

TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
Handling missing attribute values in preterm birth data sets

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
Incremental versus non-incremental rule induction for multicriteria classification

Transactions on Rough Sets II

Quantified Score

Hi-index	0.00

Visualization

Abstract

A typical real-life data set is affected by inconsistencies-- cases characterized by the same attribute values are classified as members of different concepts. The most apparent methodology to handle inconsistencies is offered by rough set theory. For every concept two sets are computed: the lower approximation and the upper approximation. From these two sets a rule induction system induces two rule sets: certain and possible. The problem is how to use these two sets in the process of classification of new, unseen cases. For example, should we use only certain rules (or only possible rules) for classification? Should certain rules be used first and, when a case does not match any certain rule, should possible rules be used later? How to combine certain and possible rules with complete and partial matching of rules by a case? This paper presents experiments that were done to answer these questions. Different strategies were compared by classifying ten real-life data sets, using the error rate as a criterion of quality.