Rule quality for multiple-rule classifier: Empirical expertise and theoretical methodology

Authors:
Ivan Bruha;Josef Tkadlec
Affiliations:
Department Computing & Software, McMaster University, Hamilton, Ont., L8S4K1 Canada. E-mail: bruha@mcmaster.ca;Department of Mathematics, Faculty of Electrical Engineering, Czech Technical University, 16627 Praha, Czech Republic. E-mail: tkadlec@fel.cvut.cz
Venue:
Intelligent Data Analysis
Year:
2003

Citing 12
Cited 4

A theory of the learnable

Communications of the ACM
Building expert systems

Building expert systems
Computational limitations on learning from examples

Journal of the ACM (JACM)
Information-Based Evaluation Criterion for Classifier's Performance

Machine Learning
Rule induction with CN2: some recent improvements

EWSL-91 Proceedings of the European working session on learning on Machine learning
Learning Two-Tiered Descriptions of Flexible Concepts: The POSEIDON System

Machine Learning
C4.5: programs for machine learning

C4.5: programs for machine learning
The CN2 Induction Algorithm

Machine Learning
An Empirical Comparison of Selection Measures for Decision-Tree Induction

Machine Learning
Induction of Decision Trees

Machine Learning
Controlled Redundancy in Incremental Rule Learning

ECML '93 Proceedings of the European Conference on Machine Learning
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

A critical review of multi-objective optimization in data mining: a position paper

ACM SIGKDD Explorations Newsletter
Raising data for improved support in rule mining: How to raise and how far to raise

Intelligent Data Analysis
Rule quality measure-based induction of unordered sets of regression rules

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Redefinition of Decision Rules Based on the Importance of Elementary Conditions Evaluation

Fundamenta Informaticae

Quantified Score

Hi-index	0.00

Visualization

Abstract

A rule-inducing learning algorithm may yield either an ordered or unordered set of decision rules. The latter seems to be more understandable by humans and directly applicable in most expert systems or decision-supporting ones. However, classification utilizing the unordered-mode decision rules may be accompanied by some conflict situations, particularly when several rules belonging to different classes match ('fire' for) an input to-be-classified (unseen) object. One of the possible solutions to this conflict is to associate each decision rule induced by a learning algorithm with a numerical factor which is commonly called the rule quality. The paper first surveys empirical and statistical formulas of the rule quality and compares their characteristics. Statistical tools such as contingency tables, measures of association, measures of agreement are introduced as suitable vehicles for depicting a behaviour of a decision rule. The above formulas as well as schemes for their combinations are experimentally tested on several well-known AI databases and compared. The covering learning algorithm CN4, a large extension of CN2, is used as an inductive vehicle. After that, theoretical methodology for defining rule qualities and schemes for their combination is acquainted. The general definitions of the notions of a Designer, Learner, and Classifier are presented in a formal matter, including parameters that are usually attached to these concepts such as rule consistency, completeness, quality, matching rate, etc. Hence, we provide the minimum-requirement definitions as necessary conditions for the above concepts. Any designer (decision-system builder) of a new multiple-rule system may start with these minimum requirements. We conclude with a general flow chart for a decision-system builder. He/she can just pursue it and select parameters of a Learner and Classifier, following the minimum characteristics provided.