Symbolic and Neural Learning Algorithms: An Experimental Comparison
Machine Learning
Rule induction with CN2: some recent improvements
EWSL-91 Proceedings of the European working session on learning on Machine learning
C4.5: programs for machine learning
C4.5: programs for machine learning
Machine Learning
Rigorous learning curve bounds from statistical mechanics
Machine Learning - Special issue on COLT '94
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Efficient progressive sampling
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Robust Classification for Imprecise Environments
Machine Learning
SAS SQL Procedure User's Guide,Version 8
SAS SQL Procedure User's Guide,Version 8
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality
Data Mining and Knowledge Discovery
A Survey of Methods for Scaling Up Inductive Algorithms
Data Mining and Knowledge Discovery
Bayesian parameter estimation via variational methods
Statistics and Computing
Pruning Decision Trees with Misclassification Costs
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
The Effects of Training Set Size on Decision Tree Complexity
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
The Case against Accuracy Estimation for Comparing Induction Algorithms
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Industry: telecommunications network diagnosis
Handbook of data mining and knowledge discovery
Tree Induction for Probability-Based Ranking
Machine Learning
Rule-based machine learning methods for functional prediction
Journal of Artificial Intelligence Research
A study of cross-validation and bootstrap for accuracy estimation and model selection
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Learning Bayesian networks with local structure
UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence
Distributed learning with bagging-like performance
Pattern Recognition Letters
Tree Induction for Probability-Based Ranking
Machine Learning
Active Sampling for Class Probability Estimation and Ranking
Machine Learning
Machine Learning
Learning Ensembles from Bites: A Scalable and Accurate Approach
The Journal of Machine Learning Research
Optimising area under the ROC curve using gradient descent
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Classification and knowledge discovery in protein databases
Journal of Biomedical Informatics - Special issue: Biomedical machine learning
IEEE Transactions on Knowledge and Data Engineering
Machine Learning
ROC confidence bands: an empirical evaluation
ICML '05 Proceedings of the 22nd international conference on Machine learning
Generalized skewing for functions with continuous and nominal attributes
ICML '05 Proceedings of the 22nd international conference on Machine learning
An empirical comparison of supervised learning algorithms
ICML '06 Proceedings of the 23rd international conference on Machine learning
Quantitative pharmacophore models with inductive logic programming
Machine Learning
Expert Systems with Applications: An International Journal
Classification in Networked Data: A Toolkit and a Univariate Case Study
The Journal of Machine Learning Research
Improving the performance of an incremental algorithm driven by error margins
Intelligent Data Analysis - Knowledge Discovery from Data Streams
Classifier Loss Under Metric Uncertainty
ECML '07 Proceedings of the 18th European conference on Machine Learning
Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning
PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Email Spam Filtering: A Systematic Review
Foundations and Trends in Information Retrieval
Learning from the Past with Experiment Databases
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Classification algorithm sensitivity to training data with non representative attribute noise
Decision Support Systems
Artificial Intelligence in Medicine
Artificial Intelligence in Medicine
A fast decision tree learning algorithm
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Comparative Analysis of Regression Tree Models for Premises Valuation Using Statistica Data Miner
ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Evaluation of machine learning techniques for prostate cancer diagnosis and Gleason grading
International Journal of Computational Intelligence in Bioinformatics and Systems Biology
Bagging different instead of similar models for regression and classification problems
International Journal of Computer Applications in Technology
An Investigation of Missing Data Methods for Classification Trees Applied to Binary Response Data
The Journal of Machine Learning Research
An iterative process for building learning curves and predicting relative performance of classifiers
EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Stepwise induction of logistic model trees
ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
A dynamic classifier ensemble selection approach for noise data
Information Sciences: an International Journal
Soft Nearest Convex Hull Classifier
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Prediction in financial markets: The case for small disjuncts
ACM Transactions on Intelligent Systems and Technology (TIST)
Inactive learning?: difficulties employing active learning in practice
ACM SIGKDD Explorations Newsletter
Area under the ROC curve by bubble-sort approach (BSA)
ACMOS'05 Proceedings of the 7th WSEAS international conference on Automatic control, modeling and simulation
Tuning metaheuristics: A data mining based approach for particle swarm optimization
Expert Systems with Applications: An International Journal
Data-driven multi-touch attribution models
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Classification with support hyperplanes
ECML'06 Proceedings of the 17th European conference on Machine Learning
Facing the spammers: A very effective approach to avoid junk e-mails
Expert Systems with Applications: An International Journal
MDAI'06 Proceedings of the Third international conference on Modeling Decisions for Artificial Intelligence
Experiment databases: a novel methodology for experimental research
KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Two New Prediction-Driven Approaches to Discrete Choice Prediction
ACM Transactions on Management Information Systems (TMIS)
Customer event history for churn prediction: How long is long enough?
Expert Systems with Applications: An International Journal
Prediction of learning curves in machine translation
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Communications of the ACM
Bias-variance analysis in estimating true query model for information retrieval
Information Processing and Management: an International Journal
Quite a mess in my cookie jar!: leveraging machine learning to protect web authentication
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.04 |
Tree induction and logistic regression are two standard, off-the-shelf methods for building models for classification. We present a large-scale experimental comparison of logistic regression and tree induction, assessing classification accuracy and the quality of rankings based on class-membership probabilities. We use a learning-curve analysis to examine the relationship of these measures to the size of the training set. The results of the study show several things. (1) Contrary to some prior observations, logistic regression does not generally outperform tree induction. (2) More specifically, and not surprisingly, logistic regression is better for smaller training sets and tree induction for larger data sets. Importantly, this often holds for training sets drawn from the same domain (that is, the learning curves cross), so conclusions about induction-algorithm superiority on a given domain must be based on an analysis of the learning curves. (3) Contrary to conventional wisdom, tree induction is effective at producing probability-based rankings, although apparently comparatively less so for a given training-set size than at making classifications. Finally, (4) the domains on which tree induction and logistic regression are ultimately preferable can be characterized surprisingly well by a simple measure of the separability of signal from noise.