Stepwise feature selection using generalized logistic loss

  • Authors:
  • Changyi Park;Ja-Yong Koo;Peter T. Kim;Jae Won Lee

  • Affiliations:
  • Department of Statistics, University of Seoul, Seoul 130-743, Republic of Korea;Department of Statistics, Korea University, Seoul 136-701, Republic of Korea;Department of Mathematics and Statistics, University of Guelph, Guelph, Ontario, Canada;Department of Statistics, Korea University, Seoul 136-701, Republic of Korea

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2008

Quantified Score

Hi-index 0.03

Visualization

Abstract

Microarray experiments have raised challenging questions such as how to make an accurate identification of a set of marker genes responsible for various cancers. In statistics, this specific task can be posed as the feature selection problem. Since a support vector machine can deal with a vast number of features, it has gained wide spread use in microarray data analysis. We propose a stepwise feature selection using the generalized logistic loss that is a smooth approximation of the usual hinge loss. We compare the proposed method with the support vector machine with recursive feature elimination for both real and simulated datasets. It is illustrated that the proposed method can improve the quality of feature selection through standardization while the method retains similar predictive performance compared with the recursive feature elimination.