Tree induction vs. logistic regression: a learning-curve analysis

Authors:
Claudia Perlich;Foster Provost;Jeffrey S. Simonoff
Affiliations:
Leonard N. Stern School of Business, New York University, 44 West 4th Street, New York, NY;Leonard N. Stern School of Business, New York University, 44 West 4th Street, New York, NY;Leonard N. Stern School of Business, New York University, 44 West 4th Street, New York, NY
Venue:
The Journal of Machine Learning Research
Year:
2003

Citing 24
Cited 51

Symbolic and Neural Learning Algorithms: An Experimental Comparison

Machine Learning
Rule induction with CN2: some recent improvements

EWSL-91 Proceedings of the European working session on learning on Machine learning
C4.5: programs for machine learning

C4.5: programs for machine learning
Bagging predictors

Machine Learning
Rigorous learning curve bounds from statistical mechanics

Machine Learning - Special issue on COLT '94
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Efficient progressive sampling

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms

Machine Learning
Robust Classification for Imprecise Environments

Machine Learning
SAS SQL Procedure User's Guide,Version 8

SAS SQL Procedure User's Guide,Version 8
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality

Data Mining and Knowledge Discovery
A Survey of Methods for Scaling Up Inductive Algorithms

Data Mining and Knowledge Discovery
Bayesian parameter estimation via variational methods

Statistics and Computing
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Pruning Decision Trees with Misclassification Costs

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
The Effects of Training Set Size on Decision Tree Complexity

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
The Case against Accuracy Estimation for Comparing Induction Algorithms

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Industry: telecommunications network diagnosis

Handbook of data mining and knowledge discovery
Tree Induction for Probability-Based Ranking

Machine Learning
Rule-based machine learning methods for functional prediction

Journal of Artificial Intelligence Research
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
The use of the area under the ROC curve in the evaluation of machine learning algorithms

Pattern Recognition
Learning Bayesian networks with local structure

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence

Distributed learning with bagging-like performance

Pattern Recognition Letters
Tree Induction for Probability-Based Ranking

Machine Learning
Active Sampling for Class Probability Estimation and Ranking

Machine Learning
Functional Trees

Machine Learning
Learning Ensembles from Bites: A Scalable and Accurate Approach

The Journal of Machine Learning Research
Optimising area under the ROC curve using gradient descent

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Model selection via the AUC

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Classification and knowledge discovery in protein databases

Journal of Biomedical Informatics - Special issue: Biomedical machine learning
Toward Intelligent Assistance for a Data Mining Process: An Ontology-Based Approach for Cost-Sensitive Classification

IEEE Transactions on Knowledge and Data Engineering
Logistic Model Trees

Machine Learning
ROC confidence bands: an empirical evaluation

ICML '05 Proceedings of the 22nd international conference on Machine learning
Generalized skewing for functions with continuous and nominal attributes

ICML '05 Proceedings of the 22nd international conference on Machine learning
Distribution-based aggregation for relational learning with identifier attributes

Machine Learning
An empirical comparison of supervised learning algorithms

ICML '06 Proceedings of the 23rd international conference on Machine learning
Quantitative pharmacophore models with inductive logic programming

Machine Learning
The feasibility of constructing a Predictive Outcome Model for breast cancer using the tools of data mining

Expert Systems with Applications: An International Journal
Classification in Networked Data: A Toolkit and a Univariate Case Study

The Journal of Machine Learning Research
Improving the performance of an incremental algorithm driven by error margins

Intelligent Data Analysis - Knowledge Discovery from Data Streams
Classifier Loss Under Metric Uncertainty

ECML '07 Proceedings of the 18th European conference on Machine Learning
Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Email Spam Filtering: A Systematic Review

Foundations and Trends in Information Retrieval
Comment on "On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes"

Neural Processing Letters
Learning from the Past with Experiment Databases

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Classification algorithm sensitivity to training data with non representative attribute noise

Decision Support Systems
Prediction of periventricular leukomalacia. Part II: Selection of hemodynamic features using computational intelligence

Artificial Intelligence in Medicine
Prediction of periventricular leukomalacia. Part I: Selection of hemodynamic features using logistic regression and decision tree algorithms

Artificial Intelligence in Medicine
A fast decision tree learning algorithm

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Comparative Analysis of Regression Tree Models for Premises Valuation Using Statistica Data Miner

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Evaluation of machine learning techniques for prostate cancer diagnosis and Gleason grading

International Journal of Computational Intelligence in Bioinformatics and Systems Biology
Bagging different instead of similar models for regression and classification problems

International Journal of Computer Applications in Technology
An Investigation of Missing Data Methods for Classification Trees Applied to Binary Response Data

The Journal of Machine Learning Research
An iterative process for building learning curves and predicting relative performance of classifiers

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Stepwise induction of logistic model trees

ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
A dynamic classifier ensemble selection approach for noise data

Information Sciences: an International Journal
Soft Nearest Convex Hull Classifier

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Prediction in financial markets: The case for small disjuncts

ACM Transactions on Intelligent Systems and Technology (TIST)
Inactive learning?: difficulties employing active learning in practice

ACM SIGKDD Explorations Newsletter
Area under the ROC curve by bubble-sort approach (BSA)

ACMOS'05 Proceedings of the 7th WSEAS international conference on Automatic control, modeling and simulation
Estimating the effect of word of mouth on churn and cross-buying in the mobile phone market with Markov logic networks

Decision Support Systems
Tuning metaheuristics: A data mining based approach for particle swarm optimization

Expert Systems with Applications: An International Journal
Data-driven multi-touch attribution models

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Classification with support hyperplanes

ECML'06 Proceedings of the 17th European conference on Machine Learning
Facing the spammers: A very effective approach to avoid junk e-mails

Expert Systems with Applications: An International Journal
Evaluating model construction methods with objective rule evaluation indices to support human experts

MDAI'06 Proceedings of the Third international conference on Modeling Decisions for Artificial Intelligence
Experiment databases: a novel methodology for experimental research

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Two New Prediction-Driven Approaches to Discrete Choice Prediction

ACM Transactions on Management Information Systems (TMIS)
Customer event history for churn prediction: How long is long enough?

Expert Systems with Applications: An International Journal
Prediction of learning curves in machine translation

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Data science and prediction

Communications of the ACM
Bias-variance analysis in estimating true query model for information retrieval

Information Processing and Management: an International Journal
Quite a mess in my cookie jar!: leveraging machine learning to protect web authentication

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.04

Visualization

Abstract

Tree induction and logistic regression are two standard, off-the-shelf methods for building models for classification. We present a large-scale experimental comparison of logistic regression and tree induction, assessing classification accuracy and the quality of rankings based on class-membership probabilities. We use a learning-curve analysis to examine the relationship of these measures to the size of the training set. The results of the study show several things. (1) Contrary to some prior observations, logistic regression does not generally outperform tree induction. (2) More specifically, and not surprisingly, logistic regression is better for smaller training sets and tree induction for larger data sets. Importantly, this often holds for training sets drawn from the same domain (that is, the learning curves cross), so conclusions about induction-algorithm superiority on a given domain must be based on an analysis of the learning curves. (3) Contrary to conventional wisdom, tree induction is effective at producing probability-based rankings, although apparently comparatively less so for a given training-set size than at making classifications. Finally, (4) the domains on which tree induction and logistic regression are ultimately preferable can be characterized surprisingly well by a simple measure of the separability of signal from noise.