Bayes optimal metasearch: a probabilistic model for combining the results of multiple retrieval systems (poster session)

  • Authors:
  • Javed A. Aslam;Mark Montague

  • Affiliations:
  • Department of Computer Science, Dartmouth College, 6211 Sudikoff Laboratory, Hanover, NH;Department of Computer Science, Dartmouth College, 6211 Sudikoff Laboratory, Hanover, NH

  • Venue:
  • SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a new, probabilistic model for combining the outputs of an arbitrary number of query retrieval systems. By gathering simple statistics on the average performance of a given set of query retrieval systems, we construct a Bayes optimal mechanism for combining the outputs of these systems. Our construction yields a metasearch strategy whose empirical performance nearly always exceeds the performance of any of the constituent systems. Our construction is also robust in the sense that if “good” and “bad” systems are combined, the Performance of the composite is still on par with, or exceeds, that of the best constituent system. Finally, our model and theory provide theoretical and empirical avenues for the improvement of this metasearch strategy.