Pairwise feature evaluation for constructing reduced representations

  • Authors:
  • Artsiom Harol;Carmen Lai;Elżbieta Pękalska;Robert P. W. Duin

  • Affiliations:
  • af1 Delft University of Technology, Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft, The Netherlands;af1 Delft University of Technology, Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft, The Netherlands;Delft University of Technology, Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft, The Netherlands and University of Manchester ...;Delft University of Technology, Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft, The Netherlands

  • Venue:
  • Pattern Analysis & Applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Feature selection methods are often used to determine a small set of informative features that guarantee good classification results. Such procedures usually consist of two components: a separability criterion and a selection strategy. The most basic choices for the latter are individual ranking, forward search and backward search. Many intermediate methods such as floating search are also available. The forward as well as backward selection may cause lossy evaluation of the criterion and/or overtraining of the final classifier in case of high-dimensional spaces and small sample size problems. Backward selection may also become computationally prohibitive. Individual ranking, on the other hand, suffers as it neglects dependencies between features. A new strategy based on a pairwise evaluation has recently been proposed by Bo and Jonassen (Genome Biol 3, 2002) and Pękalska et al. (International Conference on Computer Recognition Systems, Poland, pp 271–278, 2005). Since it considers interactions between features, but always restricted to two-dimensional spaces, it may circumvent the small sample size problem. In this paper, we evaluate this idea in a more general framework for the selection of features as well as prototypes. Our finding is that such a pairwise selection may improve over traditional procedures and we present some artificial and real-world examples to support this claim. Additionally, we have also discovered that the set of problems for which the pairwise selection may be effective is small.