Efficient query evaluation using a two-level retrieval process

  • Authors:
  • Andrei Z. Broder;David Carmel;Michael Herscovici;Aya Soffer;Jason Zien

  • Affiliations:
  • IBM Watson Research Center;IBM Research Lab in Haifa, Haifa, ISRAEL;IBM Research Lab in Haifa, Haifa, ISRAEL;IBM Research Lab in Haifa, Haifa, ISRAEL;IBM Almaden Research Center

  • Venue:
  • CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an efficient query evaluation method based on a two level approach: at the first level, our method iterates in parallel over query term postings and identifies candidate documents using an approximate evaluation taking into account only partial information on term occurrences and no query independent factors; at the second level, promising candidates are fully evaluated and their exact scores are computed. The efficiency of the evaluation process can be improved significantly using dynamic pruning techniques with very little cost in effectiveness. The amount of pruning can be controlled by the user as a function of time allocated for query evaluation. Experimentally, using the TREC Web Track data, we have determined that our algorithm significantly reduces the total number of full evaluations by more than 90%, almost without any loss in precision or recall. At the heart of our approach there is an efficient implementation of a new Boolean construct called WAND or Weak AND that might be of independent interest.