Sequential genetic search for ensemble feature selection

Authors:
Alexey Tsymbal;Mykola Pechenizkiy;Pádraig Cunningham
Affiliations:
Dept. of Computer Science, Trinity College Dublin, Dublin 2, Ireland;Dept. of CS & ISs, University of Jyvääskylä, Finland;Dept. of Computer Science, Trinity College Dublin, Dublin 2, Ireland
Venue:
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Year:
2005

Citing 9
Cited 8

Genetic algorithm for feature selection for parallel classifiers

Information Processing Letters
Technical Note: Selecting a Classification Method by Cross-Validation

Machine Learning
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature selection for ensembles

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Data Mining using MLC++, A Machine Learning Library in C++

ICTAI '96 Proceedings of the 8th International Conference on Tools with Artificial Intelligence
Designing classifier fusion systems by genetic algorithms

IEEE Transactions on Evolutionary Computation

Dynamic integration of classifiers for handling concept drift

Information Fusion
Using the RRT algorithm to optimize classification systems for handwritten digits and letters

Proceedings of the 2008 ACM symposium on Applied computing
Combining Classifiers through Triplet-Based Belief Functions

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Overfitting cautious selection of classifier ensembles with genetic algorithms

Information Fusion
A fusion of stacking with dynamic integration

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction

Artificial Intelligence in Medicine
Dynamic integration with random forests

ECML'06 Proceedings of the 17th European conference on Machine Learning
Analysis of the effectiveness of G3PARM algorithm

HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ensemble learning constitutes one of the main directions in machine learning and data mining. Ensembles allow us to achieve higher accuracy, which is often not achievable with single models. One technique, which proved to be effective for constructing an ensemble of diverse classifiers, is the use of feature subsets. Among different approaches to ensemble feature selection, genetic search was shown to perform best in many domains. In this paper, a new strategy GAS-SEFS, Genetic Algorithmbased Sequential Search for Ensemble Feature Selection, is introduced. Instead of one genetic process, it employs a series of processes, the goal of each of which is to build one base classifier. Experiments on 21 data sets are conducted, comparing the new strategy with a previously considered genetic strategy for different ensemble sizes and for five different ensemble integration methods. The experiments show that GAS-SEFS, although being more time-consuming, often builds better ensembles, especially on data sets with larger numbers of features.