Genetic wrappers for feature selection in decision tree induction and variable ordering in Bayesian network structure learning

Authors:
William H. Hsu
Affiliations:
Department of Computing and Information Sciences, Kansas State University, Manhattan, KS and National Center for Supercomputing Applications (NCSA), University of Illinois, Champaign, IL
Venue:
Information Sciences: an International Journal - Special issue: Soft computing data mining
Year:
2004

Citing 17
Cited 15

Classifier systems and genetic algorithms

Artificial Intelligence
Probabilistic reasoning in expert systems: theory and algorithms

Probabilistic reasoning in expert systems: theory and algorithms
The computational complexity of probabilistic inference using Bayesian belief networks (research note)

Artificial Intelligence
A Bayesian Method for the Induction of Probabilistic Networks from Data

Machine Learning
Using Genetic Algorithms for Concept Learning

Machine Learning - Special issue on genetic algorithms
Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

Machine Learning
Structure Learning of Bayesian Networks by Genetic Algorithms: A Performance Analysis of Control Parameters

IEEE Transactions on Pattern Analysis and Machine Intelligence
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Practical genetic algorithms

Practical genetic algorithms
Using Bayesian networks to analyze expression data

RECOMB '00 Proceedings of the fourth annual international conference on Computational molecular biology
Change of Representation and Inductive Bias

Change of Representation and Inductive Bias
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Machine Learning

Machine Learning
Learning the Dimensionality of Hidden Variables

UAI '01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence
Control of inductive bias in supervised learning using evolutionary computation: a wrapper-based approach

Data mining
AIS-BN: an adaptive importance sampling algorithm for evidential reasoning in large Bayesian networks

Journal of Artificial Intelligence Research
Real time estimation of Bayesian networks

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence

Evaluation and classification of otoneurological data with new data analysis methods based on machine learning

Information Sciences: an International Journal
Towards efficient variables ordering for Bayesian networks classifier

Data & Knowledge Engineering
Bayesian networks for imputation in classification problems

Journal of Intelligent Information Systems
Genetic algorithm-based feature set partitioning for classification problems

Pattern Recognition
Genetic algorithm-based feature set partitioning for classification problems

Pattern Recognition
Mining manufacturing data using genetic algorithm-based feature set decomposition

International Journal of Intelligent Systems Technologies and Applications
Estimating software readiness using predictive models

Information Sciences: an International Journal
Attributes reduction based on GA-CFS method

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Evolving decision tree rule based system for audio stego anomalies detection based on Hausdorff distance statistics

Information Sciences: an International Journal
mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification

Information Sciences: an International Journal
Evolutionary attribute ordering in Bayesian networks for predicting the metabolic syndrome

Expert Systems with Applications: An International Journal
A new polynomial time algorithm for bayesian network structure learning

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Comparison of metaheuristic strategies for peakbin selection in proteomic mass spectrometry data

Information Sciences: an International Journal
Decision support analysis for safety control in complex project environments based on Bayesian Networks

Expert Systems with Applications: An International Journal
Feature selection for classification of animal feed ingredients from near infrared microscopy spectra

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we address the automated tuning of input specification for supervised inductive learning and develop combinatorial optimization solutions for two such tuning problems. First, we present a framework for selection and reordering of input variables to reduce generalization error in classification and probabilistic inference. One purpose of selection is to control overfitting using validation set accuracy as a criterion for relevance. Similarly, some inductive learning algorithms, such as greedy algorithms for learning probabilistic networks, are sensitive to the evaluation order of variables. We design a generic fitness function for validation of input specification, then use it to develop two genetic algorithm wrappers: one for the variable selection problem for decision tree inducers and one for the variable ordering problem for Bayesian network structure learning. We evaluate the wrappers, using real-world data for the selection wrapper and synthetic data for both, and discuss their limitations and generalizability to other inducers.