A search engine based on query logs, and search log analysis by automatic language identification

  • Authors:
  • Michael Oakes;Yan Xu

  • Affiliations:
  • University of Sunderland, Dept. of Computing, Engineering and Technology, DGIC, Sunderland, England;University of Sunderland, Dept. of Computing, Engineering and Technology, DGIC, Sunderland, England

  • Venue:
  • CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work describes a variation on the traditional Information Retrieval paradigm, where instead of text documents being indexed according to their content, they are indexed according to the search terms previous users have used in finding them. We determine the effectiveness of this approach by indexing a sample of query logs from the European Library, and describe its usefulness for multilingual searching. In our analysis of the search logs, we determine the language of the past queries automatically, and annotate the search logs accordingly. From this information, we derive matrices to show that a) users tend to persist with the same query language throughout a query session, and b) submit queries in the same language as the interface they have selected.