Feature Selection for Ensembles of Simple Bayesian Classifiers

Authors:
Alexey Tsymbal;Seppo Puuronen;David Patterson
Affiliations:
-;-;-
Venue:
ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Year:
2002

Citing 10
Cited 2

On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature selection for ensembles

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Neural Network Ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Dynamic Integration Algorithm for an Ensemble of Classifiers

ISMIS '99 Proceedings of the 11th International Symposium on Foundations of Intelligent Systems
Bagging and the Random Subspace Method for Redundant Feature Spaces

MCS '01 Proceedings of the Second International Workshop on Multiple Classifier Systems
Data Mining using MLC++, A Machine Learning Library in C++

ICTAI '96 Proceedings of the 8th International Conference on Tools with Artificial Intelligence
A simple approach to building ensembles of Naive Bayesian classifiers for word sense disambiguation

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Local Feature Selection with Dynamic Integration of Classifiers

Fundamenta Informaticae - Intelligent Systems

A neural network-based framework for the reconstruction of incomplete data sets

Neurocomputing
Search strategies for ensemble feature selection in medical diagnostics

CBMS'03 Proceedings of the 16th IEEE conference on Computer-based medical systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A popular method for creating an accurate classifier from a set of training data is to train several classifiers, and then to combine their predictions. The ensembles of simple Bayesian classifiers have traditionally not been a focus of research. However, the simple Bayesian classifier has much broader applicability than previously thought. Besides its high classification accuracy, it also has advantages in terms of simplicity, learning speed, classification speed, storage space, and incrementality. One way to generate an ensemble of simple Bayesian classifiers is to use different feature subsets as in the random subspace method. In this paper we present a technique for building ensembles of simple Bayesian classifiers in random subspaces. We consider also a hill-climbing-based refinement cycle, which improves accuracy and diversity of the base classifiers. We conduct a number of experiments on a collection of real-world and synthetic data sets. In many cases the ensembles of simple Bayesian classifiers have significantly higher accuracy than the single "global" simple Bayesian classifier. We consider several methods for integration of simple Bayesian classifiers. The dynamic integration better utilizes ensemble diversity than the static integration.