Workload sampling for enterprise search evaluation

  • Authors:
  • Tom Rowlands;David Hawking;Ramesh Sankaranarayana

  • Affiliations:
  • Australian National University and CSIRO Australia, Canberra, Australia;CSIRO, Canberra, Australia;Australian National University, Canberra, Australia

  • Venue:
  • SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In real world use of test collection methods, it is essential that the query test set be representative of the work load expected in the actual application. Using a random sample of queries from a media company's query log as a 'gold standard' test set we demonstrate that biases in sitemap-derived and top n query sets can lead to significant perturbations in engine rankings and big differences in estimated performance levels.