SVM based feature selection: why are we using the dual?

  • Authors:
  • Guillermo L. Grinblat;Javier Izetta;Pablo M. Granitto

  • Affiliations:
  • French Argentine International Center for Information and Systems Sciences, France and UNR-CONICET, Rosario, Argentina;French Argentine International Center for Information and Systems Sciences, France and UNR-CONICET, Rosario, Argentina;French Argentine International Center for Information and Systems Sciences, France and UNR-CONICET, Rosario, Argentina

  • Venue:
  • IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most Support Vector Machines (SVM) implementations are based on solving the dual optimization problem. Of course, feature selection algorithms based on SVM are not different and, in particular, the most used method in the area, Guyon et al.'s Recursive Feature Elimination (SVM-RFE) is also based on the dual problem. However, this is just one of the options available to find a solution to the original SVM optimization problem. In this work we discuss some potential problems that arise when ranking features with the dual-based version of SVM-RFE and propose a primal-based version of this well-known method, PSVM-RFE. We show that our new method is able to produce a better detection of relevant features, in particular in situations involving non-linear decision boundaries. Using several artificial and real-world datasets we compare both versions of SVM-RFE, finding that PSVM-RFE is preferable in most situations.