Boosting and measuring the performance of ensembles for a successful database marketing

Authors:
YongSeog Kim
Affiliations:
MIS Department, Jon M. Huntsman College of Business, Utah State University, Logan, UT 84322, USA
Venue:
Expert Systems with Applications: An International Journal
Year:
2009

Citing 19
Cited 4

Original Contribution: Stacked generalization

Neural Networks
The nature of statistical learning theory

The nature of statistical learning theory
Bagging predictors

Machine Learning
Optimal linear combinations of neural networks

Neural Networks
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature selection for ensembles

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization

Machine Learning
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Random Forests

Machine Learning
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Combination of multiple classifiers for the customer's purchase behavior prediction

Decision Support Systems - Special issue: Agents and e-commerce business models
The Case against Accuracy Estimation for Comparing Induction Algorithms

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Model selection for medical diagnosis decision support systems

Decision Support Systems
Learning Ensembles from Bites: A Scalable and Accurate Approach

The Journal of Machine Learning Research
An intelligent system for customer targeting: a data mining approach

Decision Support Systems
Stacked generalization: when does it work?

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
The foundations of cost-sensitive learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
The use of the area under the ROC curve in the evaluation of machine learning algorithms

Pattern Recognition

A data mining framework for detecting subscription fraud in telecommunication

Engineering Applications of Artificial Intelligence
Save the best for last? The treatment of dominant predictors in financial forecasting

Expert Systems with Applications: An International Journal
Accurate Prediction of Coronary Artery Disease Using Reliable Diagnosis System

Journal of Medical Systems
Ensemble methods for advanced skier days prediction

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

This paper provides insights on advantages and disadvantages of two ensemble models: ensembles based on sampling and feature selection. Experimental results confirm that both ensemble methods make robust ensembles and significantly improve the prediction performance of single classifiers at the cost of interpretability and additional computing resources. In particular, classifiers utilizing prior class distributions like support vector machine and naive Bayesian classifier only marginally benefit from ensembles, while classifiers with higher variance like neural networks and tree learners make a strong ensemble. Further, there seems to be an optimal ratio of selecting input variables that maximizes the performance of ensembles while minimizing computational costs when feature selection is used to create ensembles. Finally, we show that most evaluation methods become useless when we compare models on data sets with very skewed class distributions.