A data mining-based subset selection for enhanced discrimination using iterative elimination of redundancy

Authors:
Hyun-Woo Cho
Affiliations:
Department of Industrial and Information Engineering, University of Tennessee, Knoxville, TN 37996, USA
Venue:
Expert Systems with Applications: An International Journal
Year:
2009

Citing 13
Cited 0

The nature of statistical learning theory

The nature of statistical learning theory
Support-Vector Networks

Machine Learning
Nonlinear component analysis as a kernel eigenvalue problem

Neural Computation
Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
Kernel partial least squares regression in reproducing kernel hilbert space

The Journal of Machine Learning Research
Kernel independent component analysis

The Journal of Machine Learning Research
An introduction to variable and feature selection

The Journal of Machine Learning Research
Variable selection using svm based criteria

The Journal of Machine Learning Research
Use of the zero norm with linear models and kernel methods

The Journal of Machine Learning Research
Generalized Discriminant Analysis Using a Kernel Approach

Neural Computation
Feature subset selection for support vector machines through discriminative function pruning analysis

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Input space versus feature space in kernel-based methods

IEEE Transactions on Neural Networks
An introduction to kernel-based learning algorithms

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	12.05

Visualization

Abstract

The presence of redundant or irrelevant features in data mining may result in a mask of underlying patterns. Thus one often reduces the number of features by applying a feature selection technique. The objective of feature selection is to get a feature subset that has the best performance. This work proposes a new feature selection method using orthogonal filtering and nonlinear representation of data for an enhanced discrimination performance. An orthogonal filtering is implemented to remove unwanted variation of data. The proposed method adopts kernel principal component analysis, one of nonlinear kernel methods, to extract nonlinear characteristics of data and to reduce the dimensionality of data. The proposed feature selection method is based on the selection criterion of linear discriminant analysis in an environment of iterative backward feature elimination. The performance of the proposed method is compared with those of three different methods. The results showed that it outperforms the three methods. The use of filtering and a kernel method was shown to be a promising tool for an efficient feature selection.