Validating query simulators: an experiment using commercial searches and purchases

  • Authors:
  • Bouke Huurnink;Katja Hofmann;Maarten De Rijke;Marc Bron

  • Affiliations:
  • ISLA, University of Amsterdam, The Netherlands;ISLA, University of Amsterdam, The Netherlands;ISLA, University of Amsterdam, The Netherlands;ISLA, University of Amsterdam, The Netherlands

  • Venue:
  • CLEF'10 Proceedings of the 2010 international conference on Multilingual and multimodal information access evaluation: cross-language evaluation forum
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We design and validate simulators for generating queries and relevance judgments for retrieval system evaluation. We develop a simulation framework that incorporates existing and new simulation strategies. To validate a simulator, we assess whether evaluation using its output data ranks retrieval systems in the same way as evaluation using real-world data. The real-world data is obtained using logged commercial searches and associated purchase decisions. While no simulator reproduces an ideal ranking, there is a large variation in simulator performance that allows us to distinguish those that are better suited to creating artificial testbeds for retrieval experiments. Incorporating knowledge about document structure in the query generation process helps create more realistic simulators.