Flexible sample selection strategies for transfer learning in ranking

  • Authors:
  • Kevin Duh;Akinori Fujino

  • Affiliations:
  • NTT Communication Science Laboratories, 2-4 Hikaridai, Keihanna Science City, Kyoto 619-0237, Japan;NTT Communication Science Laboratories, 2-4 Hikaridai, Keihanna Science City, Kyoto 619-0237, Japan

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2012
  • Cross-task crowdsourcing

    Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index 0.00

Visualization

Abstract

Ranking is a central component in information retrieval systems; as such, many machine learning methods for building rankers have been developed in recent years. An open problem is transfer learning, i.e. how labeled training data from one domain/market can be used to build rankers for another. We propose a flexible transfer learning strategy based on sample selection. Source domain training samples are selected if the functional relationship between features and labels do not deviate much from that of the target domain. This is achieved through a novel application of recent advances from density ratio estimation. The approach is flexible, scalable, and modular. It allows many existing supervised rankers to be adapted to the transfer learning setting. Results on two datasets (Yahoo's Learning to Rank Challenge and Microsoft's LETOR data) show that the proposed method gives robust improvements.