Select-the-Best-Ones: A new way to judge relative relevance

  • Authors:
  • Ruihua Song;Qingwei Guo;Ruochi Zhang;Guomao Xin;Ji-Rong Wen;Yong Yu;Hsiao-Wuen Hon

  • Affiliations:
  • Shanghai Jiao Tong University, Shanghai 200240, China and Microsoft Research Asia, Beijing 100190, China;Microsoft Research Asia, Beijing 100190, China;Microsoft Research Asia, Beijing 100190, China;Microsoft Research Asia, Beijing 100190, China;Microsoft Research Asia, Beijing 100190, China;Shanghai Jiao Tong University, Shanghai 200240, China;Microsoft Research Asia, Beijing 100190, China

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the traditional evaluation of information retrieval systems, assessors are asked to determine the relevance of a document on a graded scale, independent of any other documents. Such judgments are absolute judgments. Learning to rank brings some new challenges to this traditional evaluation methodology, especially regarding absolute relevance judgments. Recently preferences judgments have been investigated as an alternative. Instead of assigning a relevance grade to a document, an assessor looks at a pair of pages and judges which one is better. In this paper, we generalize pairwise preference judgments to relative judgments. We formulate the problem of relative judgments in a formal way and then propose a new strategy called Select-the-Best-Ones to solve the problem. Through user studies, we compare our proposed method with a pairwise preference judgment method and an absolute judgment method. The results indicate that users can distinguish by about one more relevance degree when using relative methods than when using the absolute method. Consequently, the relative methods generate 15-30% more document pairs for learning to rank. Compared to the pairwise method, our proposed method increases the agreement among assessors from 95% to 99%, while halving the labeling time and the number of discordant pairs to experts' judgments.