Bi-objective feature selection for discriminant analysis in two-class classification

Authors:
JoaquıN Pacheco;Silvia Casado;Francisco Angel-Bello;Ada ÁLvarez
Affiliations:
Departamento de Economıa Aplicada, Universidad de Burgos, Spain;Departamento de Economıa Aplicada, Universidad de Burgos, Spain;Instituto Tecnológico de Estudios Superiores de Monterrey, Campus Monterrey, Mexico;Universidad Autónoma de Nuevo León, Mexico
Venue:
Knowledge-Based Systems
Year:
2013

Citing 21
Cited 1

Feature subset selection by Bayesian network-based optimization

Artificial Intelligence
Feature Selection for Knowledge Discovery and Data Mining

Feature Selection for Knowledge Discovery and Data Mining
Evolutionary Algorithms for Solving Multi-Objective Problems

Evolutionary Algorithms for Solving Multi-Objective Problems
Prototype Selection and Feature Subset Selection by Estimation of Distribution Algorithms. A Case Study in the Survival of Cirrhotic Patients Treated with TIPS

AIME '01 Proceedings of the 8th Conference on AI in Medicine in Europe: Artificial Intelligence Medicine
Analysis of new variable selection methods for discriminant analysis

Computational Statistics & Data Analysis
Multi-objective Feature Selection with NSGA II

ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
Application of a niched Pareto genetic algorithm for selecting features for nuclear transients classification

International Journal of Intelligent Systems
Feature selection in bankruptcy prediction

Knowledge-Based Systems
A generic multi-dimensional feature extraction method using multiobjective genetic programming

Evolutionary Computation
Sensitivity and specificity based multiobjective approach for feature selection: Application to cancer diagnosis

Information Processing Letters
Parallel multiobjective memetic RBFNNs design and feature selection for function approximation problems

Neurocomputing
Multi-objective feature selection by using NSGA-II for customer churn prediction in telecommunications

Expert Systems with Applications: An International Journal
A multiobjective evolutionary approach to concurrently learn rule and data bases of linguistic fuzzy-rule-based systems

IEEE Transactions on Fuzzy Systems
Feature analysis and classification of protein secondary structure data

ICANN/ICONIP'03 Proceedings of the 2003 joint international conference on Artificial neural networks and neural information processing
Simple instance selection for bankruptcy prediction

Knowledge-Based Systems
Bankruptcy prediction models based on multinorm analysis: An alternative to accounting ratios

Knowledge-Based Systems
Supervised immune clonal evolutionary classification algorithm for high-dimensional data

Neurocomputing
Feature selection using rough entropy-based uncertainty measures in incomplete decision systems

Knowledge-Based Systems
NSGA-II-trained neural network approach to the estimation of prediction intervals of scale deposition rate in oil & gas equipment

Expert Systems with Applications: An International Journal
Feature selection using dynamic weights for classification

Knowledge-Based Systems
Feature selection based on cluster and variability analyses for ordinal multi-class classification problems

Knowledge-Based Systems

Repeated double cross-validation for choosing a single solution in evolutionary multi-objective fuzzy classifier design

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This works deals with the problem of selecting variables (features) that are subsequently used in discriminant analysis. The aim is to find, from a set of m variables, smaller subsets which enable an efficient classification of cases in two classes. We consider two objectives, each one associated with the misclassification error in each class (type I and type II errors). Thus, we establish a bi-objective problem and develop an algorithm based on the NSGA-II strategy to this specific problem, in order to obtain a set of non-dominated solutions. Managing these two objectives separately (and not jointly) allows an enhanced analysis of the obtained solutions by observing the approach to efficient frontier. This is especially significant when each type of error has a different level of importance or when they cannot be compared. To illustrate these issues, several known databases from literature are used, as well as an additional database with several Spanish firms featured by financial variables and two classes: ''creditworthy'' and ''non-creditworthy''. Finally, we show that when solutions obtained by our NSGA-II implementation are evaluated from the classic mono-objective perspective (minimizing the ratio of both error types jointly) they are better than those obtained by classic methods for feature selection and similar than those provided by other recently published methods.