ENDER: a statistical framework for boosting decision rules

Authors:
Krzysztof Dembczyński;Wojciech Kotłowski;Roman Słowiński
Affiliations:
Poznań University of Technology, Poznań, Poland 60-965;Poznań University of Technology, Poznań, Poland 60-965;Poznań University of Technology, Poznań, Poland 60-965 and Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland 01-447
Venue:
Data Mining and Knowledge Discovery
Year:
2010

Citing 21
Cited 7

An introduction to computational learning theory

An introduction to computational learning theory
Bagging predictors

Machine Learning
Unifying instance-based and rule-based induction

Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
Separate-and-Conquer Rule Learning

Artificial Intelligence Review
A simple, fast, and effective rule learner

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Improved Boosting Algorithms Using Confidence-rated Predictions

Machine Learning - The Eleventh Annual Conference on computational Learning Theory
An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization

Machine Learning
Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory

Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory
Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Sets: Theoretical Aspects of Reasoning about Data
Knowledge Discovery and Measures of Interest

Knowledge Discovery and Measures of Interest
An Implementation of Logical Analysis of Data

IEEE Transactions on Knowledge and Data Engineering
The CN2 Induction Algorithm

Machine Learning
Lightweight Rule Induction

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Local Attribute Value Grouping for Lazy Rule Induction

TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
The set covering machine

The Journal of Machine Learning Research
RIONA: A New Classification System Combining Rule Induction and Instance-Based Learning

Fundamenta Informaticae
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Mining Pareto-optimal rules with respect to support and confirmation or support and anti-support

Engineering Applications of Artificial Intelligence
Margin-based first-order rule learning

Machine Learning
Maximum likelihood rule ensembles

Proceedings of the 25th international conference on Machine learning

Guest Editorial: Global modeling using local patterns

Data Mining and Knowledge Discovery
Dynamic Programming Approach for Partial Decision Rule Optimization

Fundamenta Informaticae - Concurrency Specification and Programming (CS&P)
Dynamic programming approach to optimization of approximate decision rules

Information Sciences: an International Journal
A niching genetic programming-based multi-objective algorithm for hybrid data classification

Neurocomputing
CHIRA---Convex Hull Based Iterative Algorithm of Rules Aggregation

Fundamenta Informaticae
Redefinition of Decision Rules Based on the Importance of Elementary Conditions Evaluation

Fundamenta Informaticae
Classifiers Based on Optimal Decision Rules

Fundamenta Informaticae - To Andrzej Skowron on His 70th Birthday

Quantified Score

Hi-index	0.00

Visualization

Abstract

Induction of decision rules plays an important role in machine learning. The main advantage of decision rules is their simplicity and human-interpretable form. Moreover, they are capable of modeling complex interactions between attributes. In this paper, we thoroughly analyze a learning algorithm, called ENDER, which constructs an ensemble of decision rules. This algorithm is tailored for regression and binary classification problems. It uses the boosting approach for learning, which can be treated as generalization of sequential covering. Each new rule is fitted by focusing on examples which were the hardest to classify correctly by the rules already present in the ensemble. We consider different loss functions and minimization techniques often encountered in the boosting framework. The minimization techniques are used to derive impurity measures which control construction of single decision rules. Properties of four different impurity measures are analyzed with respect to the trade-off between misclassification (discrimination) and coverage (completeness) of the rule. Moreover, we consider regularization consisting of shrinking and sampling. Finally, we compare the ENDER algorithm with other well-known decision rule learners such as SLIPPER, LRI and RuleFit.