A statistical model of query log generation

  • Authors:
  • Georges Dupret;Benjamin Piwowarski;Carlos Hurtado;Marcelo Mendoza

  • Affiliations:
  • Yahoo! Research Latin America;Yahoo! Research Latin America;Departamento de Ciencias de la Computación, Universidad de Chile;Departamento de Ciencias de la Computación, Universidad de Chile

  • Venue:
  • SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Query logs record past query sessions across a time span. A statistical model is proposed to explain the log generation process. Within a search engine list of results, the model explains the document selection – a user’s click – by taking into account both a document position and its popularity. We show that it is possible to quantify this influence and consequently estimate document “un-biased” popularities. Among other applications, this allows to re-order the result list to match more closely user preferences and to use the logs as a feedback to improve search engines.