Constraint scores for semi-supervised feature selection: A comparative study

  • Authors:
  • Mariam Kalakech;Philippe Biela;Ludovic Macaire;Denis Hamad

  • Affiliations:
  • HEI, 13 rue de Toul, F-59046 Lille, France and LAGIS FRE CNRS 3303, Université Lille 1, Bítiment P2, Cité Scientifique, F-59655 Villeneuve d'Ascq, France;HEI, 13 rue de Toul, F-59046 Lille, France and LAGIS FRE CNRS 3303, Université Lille 1, Bítiment P2, Cité Scientifique, F-59655 Villeneuve d'Ascq, France;LAGIS FRE CNRS 3303, Université Lille 1, Bítiment P2, Cité Scientifique, F-59655 Villeneuve d'Ascq, France;LISIC, ULCO, 50 rue Ferdinand Buisson, F-62228 Calais, France

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2011

Quantified Score

Hi-index 0.10

Visualization

Abstract

Recent feature selection scores using pairwise constraints (must-link and cannot-link) have shown better performances than the unsupervised methods and comparable to the supervised ones. However, these scores use only the pairwise constraints and ignore the available information brought by the unlabeled data. Moreover, these constraint scores strongly depend on the given must-link and cannot-link subsets built by the user. In this paper, we address these problems and propose a new semi-supervised constraint score that uses both pairwise constraints and local properties of the unlabeled data. Experiments using Kendall's coefficient and accuracy rates, show that this new score is less sensitive to the given constraints than the previous scores while providing similar performances.