A selective sampling strategy for label ranking

  • Authors:
  • Massih Amini;Nicolas Usunier;François Laviolette;Alexandre Lacasse;Patrick Gallinari

  • Affiliations:
  • Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, Paris, France;Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, Paris, France;Département IFT-GLO, Université Laval, Sainte-Foy (QC), Canada;Département IFT-GLO, Université Laval, Sainte-Foy (QC), Canada;Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, Paris, France

  • Venue:
  • ECML'06 Proceedings of the 17th European conference on Machine Learning
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a novel active learning strategy based on the compression framework of [9] for label ranking functions which, given an input instance, predict a total order over a predefined set of alternatives. Our approach is theoretically motivated by an extension to ranking and active learning of Kääriäinen's generalization bounds using unlabeled data [7], initially developed in the context of classification. The bounds we obtain suggest a selective sampling strategy provided that a sufficiently, yet reasonably large initial labeled dataset is provided. Experiments on Information Retrieval corpora from automatic text summarization and question/answering show that the proposed approach allows to substantially reduce the labeling effort in comparison to random and heuristic-based sampling strategies.