Fast feature selection aimed at high-dimensional data via hybrid-sequential-ranked searches

  • Authors:
  • R. Ruiz;J. C. Riquelme;J. S. Aguilar-Ruiz;M. García-Torres

  • Affiliations:
  • School of Engineering, Pablo de Olavide University, Ctra. Utrera km. 1, 41013 Seville, Spain;Department of Computer Science, University of Seville, Avda. Reina Mercedes s/n, 41012 Seville, Spain;School of Engineering, Pablo de Olavide University, Ctra. Utrera km. 1, 41013 Seville, Spain;School of Engineering, Pablo de Olavide University, Ctra. Utrera km. 1, 41013 Seville, Spain

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

We address the feature subset selection problem for classification tasks. We examine the performance of two hybrid strategies that directly search on a ranked list of features and compare them with two widely used algorithms, the fast correlation based filter (FCBF) and sequential forward selection (SFS). The proposed hybrid approaches provide the possibility of efficiently applying any subset evaluator, with a wrapper model included, to large and high-dimensional domains. The experiments performed show that our two strategies are competitive and can select a small subset of features without degrading the classification error or the advantages of the strategies under study.