An introduction to variable and feature selection

  • Authors:
  • Isabelle Guyon;André Elisseeff

  • Affiliations:
  • Clopinet, 955 Creston Road, Berkeley, CA;Empirical Inference for Machine Learning and Perception Department, Max Planck Institute for Biological Cybernetics, Spemannstrasse 38, 72076 Tübingen, Germany

  • Venue:
  • The Journal of Machine Learning Research
  • Year:
  • 2003

Quantified Score

Hi-index 0.11

Visualization

Abstract

Variable and feature selection have become the focus of much research in areas of application for which datasets with tens or hundreds of thousands of variables are available. These areas include text processing of internet documents, gene expression array analysis, and combinatorial chemistry. The objective of variable selection is three-fold: improving the prediction performance of the predictors, providing faster and more cost-effective predictors, and providing a better understanding of the underlying process that generated the data. The contributions of this special issue cover a wide range of aspects of such problems: providing a better definition of the objective function, feature construction, feature ranking, multivariate feature selection, efficient search methods, and feature validity assessment methods.