Automatic question answering using the web: Beyond the Factoid

Authors:
Radu Soricut;Eric Brill
Affiliations:
Information Sciences Institute, University of Southern California, Marina del key, USA 90292;Microsoft Research, Redmond, USA 98052
Venue:
Information Retrieval
Year:
2006

Citing 15
Cited 15

Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Bridging the lexical chasm: statistical approaches to answer-finding

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
High performance question/answering

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Scaling question answering to the web

ACM Transactions on Information Systems (TOIS)
Mining the web for answers to natural language questions

Proceedings of the tenth international conference on Information and knowledge management
Question Answering from Frequently Asked Question Files: Experiences with the FAQ Finder System

Question Answering from Frequently Asked Question Files: Experiences with the FAQ Finder System
Trainable question-answering systems

Trainable question-answering systems
Accurate methods for the statistics of surprise and coincidence

Computational Linguistics - Special issue on using large corpora: I
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Learning to find answers to questions on the Web

ACM Transactions on Internet Technology (TOIT)
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A noisy-channel approach to question answering

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Automatic detection of causal relations for Question Answering

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12

Handling biographical questions with implicature

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Automatically Acquiring Causal Expression Patterns from Relation-annotated Corpora to Improve Question Answering for why-Questions

ACM Transactions on Asian Language Information Processing (TALIP)
Answering Any Class of Japanese Non-factoid Question by Using the Web and Example Q&A Pairs from a Social Q&A Website

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Translating queries into snippets for improved query expansion

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
An empirical study of corpus-based response automation methods for an e-mail-based help-desk domain

Computational Linguistics
Automated skimming in response to questions for nonvisual readers

SLPAT '10 Proceedings of the NAACL HLT 2010 Workshop on Speech and Language Processing for Assistive Technologies
PTM: probabilistic topic mapping model for mining parallel document collections

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Query rewriting using monolingual statistical machine translation

Computational Linguistics
Automatic keyphrase extraction by bridging vocabulary gap

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Improved answer ranking in social question-answering portals

Proceedings of the 3rd international workshop on Search and mining user-generated contents
Mining the interests of Chinese microbloggers via keyword extraction

Frontiers of Computer Science in China
A simple word trigger method for social tag suggestion

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A comparative study of information-gathering approaches for answering help-desk email inquiries

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Building structures from classifiers for passage reranking

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Detection of imperative and declarative question--answer pairs in email conversations

AI Communications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we describe and evaluate a Question Answering (QA) system that goes beyond answering factoid questions. Our approach to QA assumes no restrictions on the type of questions that are handled, and no assumption that the answers to be provided are factoids. We present an unsupervised approach for collecting question and answer pairs from FAQ pages, which we use to collect a corpus of 1 million question/answer pairs from FAQ pages available on the Web. This corpus is used to train various statistical models employed by our QA system: a statistical chunker used to transform a natural language-posed question into a phrase-based query to be submitted for exact match to an off-the-shelf search engine; an answer/question translation model, used to assess the likelihood that a proposed answer is indeed an answer to the posed question; and an answer language model, used to assess the likelihood that a proposed answer is a well-formed answer. We evaluate our QA system in a modular fashion, by comparing the performance of baseline algorithms against our proposed algorithms for various modules in our QA system. The evaluation shows that our system achieves reasonable performance in terms of answer accuracy for a large variety of complex, non-factoid questions.