A competitive ensemble pruning approach based on cross-validation technique

Authors:
Qun Dai
Affiliations:
Institute of Computer Science and Technology, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China
Venue:
Knowledge-Based Systems
Year:
2013

Citing 27
Cited 3

The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Ensembling neural networks: many could be better than all

Artificial Intelligence
Neural Network Ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence
Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy

Machine Learning
On the Boosting Pruning Problem

ECML '00 Proceedings of the 11th European Conference on Machine Learning
Ensemble Methods in Machine Learning

MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
Pruning Adaptive Boosting

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Pruning and dynamic scheduling of cost-sensitive ensembles

Eighteenth national conference on Artificial intelligence
Improved CBP Neural Network Model with Applications in Time Series Prediction

Neural Processing Letters
Ensemble selection from libraries of models

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Pruning in ordered bagging ensembles

ICML '06 Proceedings of the 23rd international conference on Machine learning
Using diversity of errors for selecting members of a committee classifier

Pattern Recognition
Using boosting to prune bagging ensembles

Pattern Recognition Letters
Selective fusion of heterogeneous classifiers

Intelligent Data Analysis
Ensemble Pruning Via Semi-definite Programming

The Journal of Machine Learning Research
EROS: Ensemble rough subspaces

Pattern Recognition
Unsupervised data pruning for clustering of noisy data

Knowledge-Based Systems
Pruning an ensemble of classifiers via reinforcement learning

Neurocomputing
Selective ensemble of decision trees

RSFDGrC'03 Proceedings of the 9th international conference on Rough sets, fuzzy sets, data mining, and granular computing
An ensemble uncertainty aware measure for directed hill climbing ensemble pruning

Machine Learning
An efficient fuzzy weighted average algorithm for the military UAV selecting under group decision-making

Knowledge-Based Systems
An ensemble design of intrusion detection system for handling uncertainty using Neutrosophic Logic Classifier

Knowledge-Based Systems
Constructing rough decision forests

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
Mixture of random prototype-based local experts

HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part I
Lung cancer cell identification based on artificial neural network ensembles

Artificial Intelligence in Medicine
The build of n-Bits Binary Coding ICBP Ensemble System

Neurocomputing
Stability problems with artificial neural networks and the ensemble solution

Artificial Intelligence in Medicine

A novel ensemble pruning algorithm based on randomized greedy selective strategy and ballot

Neurocomputing
A survey of multiple classifier systems as hybrid systems

Information Fusion
Clustering-based ensembles for one-class classification

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ensemble pruning is crucial for the consideration of both efficiency and predictive accuracy of an ensemble system. This paper proposes a new Competitive technique for Ensemble Pruning based on Cross-Validation (CEPCV). The data to be learnt by neural computing models are mostly drifting with time and environment, therefore a dynamic ensemble pruning method is indispensable for practical applications, while the proposed CEPCV method is just the kind of dynamic ensemble pruning method, which can realize on-line ensemble pruning and take full advantage of potentially valuable information. The algorithm naturally inherits the predominance of cross-validation technique, which implies that those networks regarded as winners in selective competitions and chosen into the pruned ensemble have the ''strongest'' generalization capability. It is essentially based on the strategy of ''divide and rule, collect the wisdom'', and might alleviate the local minima problem of many conventional ensemble pruning approaches only at the cost of a little greater computational cost, which is acceptable to most applications of ensemble learning. The comparative experiments among the four ensemble pruning algorithms, including: CEPCV and the state-of-the-art Directed Hill Climbing Ensemble Pruning (DHCEP) algorithm and two baseline methods, i.e. BSM, which chooses the Best Single Model in the initial ensemble based on their performances on the pruning set, and ALL, which reserves all network members of the initial ensemble, on ten benchmark classification tasks, demonstrate the effectiveness and validity of CEPCV.