Experiments in Query Paraphrasing for Information Retrieval

  • Authors:
  • Ingrid Zukerman;Bhavani Raskutti;Yingying Wen

  • Affiliations:
  • -;-;-

  • Venue:
  • AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate the effect of paraphrase generation on document retrieval performance. Specifically, we describe experiments where three information sources are used to generate lexical paraphrases of queries posed to the Internet. These information sources are: WordNet, a Webster-based thesaurus, and a combination of Webster and WordNet. Corpus-based information and wordsimilarity information are then used to rank the paraphrases. We evaluated our mechanism using 404 queries whose answers reside in the LA Times subset of the TREC-9 corpus. Our experiments show that query paraphrasing improves retrieval performance, and that performance is influenced both by the number of paraphrases generated for a query and by their quality. Specifically, the best performance was obtained usingWordNet, which improves document recall by 14% and increases the number of questions that can be answered by 8%.