Original Contribution: Stacked generalization
Neural Networks
The nature of statistical learning theory
The nature of statistical learning theory
Machine Learning
Optimal linear combinations of neural networks
Neural Networks
The Random Subspace Method for Constructing Decision Forests
IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature selection for ensembles
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
A streaming ensemble algorithm (SEA) for large-scale classification
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Machine Learning
Combination of multiple classifiers for the customer's purchase behavior prediction
Decision Support Systems - Special issue: Agents and e-commerce business models
The Case against Accuracy Estimation for Comparing Induction Algorithms
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Mining concept-drifting data streams using ensemble classifiers
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Model selection for medical diagnosis decision support systems
Decision Support Systems
Learning Ensembles from Bites: A Scalable and Accurate Approach
The Journal of Machine Learning Research
An intelligent system for customer targeting: a data mining approach
Decision Support Systems
Stacked generalization: when does it work?
IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
The foundations of cost-sensitive learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
A data mining framework for detecting subscription fraud in telecommunication
Engineering Applications of Artificial Intelligence
Save the best for last? The treatment of dominant predictors in financial forecasting
Expert Systems with Applications: An International Journal
Accurate Prediction of Coronary Artery Disease Using Reliable Diagnosis System
Journal of Medical Systems
Ensemble methods for advanced skier days prediction
Expert Systems with Applications: An International Journal
Hi-index | 12.05 |
This paper provides insights on advantages and disadvantages of two ensemble models: ensembles based on sampling and feature selection. Experimental results confirm that both ensemble methods make robust ensembles and significantly improve the prediction performance of single classifiers at the cost of interpretability and additional computing resources. In particular, classifiers utilizing prior class distributions like support vector machine and naive Bayesian classifier only marginally benefit from ensembles, while classifiers with higher variance like neural networks and tree learners make a strong ensemble. Further, there seems to be an optimal ratio of selecting input variables that maximizes the performance of ensembles while minimizing computational costs when feature selection is used to create ensembles. Finally, we show that most evaluation methods become useless when we compare models on data sets with very skewed class distributions.