Probabilistic question answering on the Web: Research Articles

  • Authors:
  • Dragomir Radev;Weiguo Fan;Hong Qi;Harris Wu;Amardeep Grewal

  • Affiliations:
  • University of Michigan, Ann Arbor, MI 48109;Virginia Polytechnic Institute and State University, Blacksburg, VA 24061;University of Michigan, Ann Arbor, MI 48109;University of Michigan, Ann Arbor, MI 48109;University of Michigan, Ann Arbor, MI 48109

  • Venue:
  • Journal of the American Society for Information Science and Technology
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web-based search engines such as Google and NorthernLight return documents that are relevant to a user query, not answers to user questions. We have developed an architecture that augments existing search engines so that they support natural language question answering. The process entails five steps: query modulation, document retrieval, passage extraction, phrase extraction, and answer ranking. In this article, we describe some probabilistic approaches to the last three of these stages. We show how our techniques apply to a number of existing search engines, and we also present results contrasting three different methods for question answering. Our algorithm, probabilistic phrase reranking (PPR), uses proximity and question type features and achieves a total reciprocal document rank of .20 on the TREC8 corpus. Our techniques have been implemented as a Web-accessible system, called NSIR. © 2005 Wiley Periodicals, Inc.