MURAX: a robust linguistic approach for question answering using an on-line encyclopedia
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Bridging the lexical chasm: statistical approaches to answer-finding
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
On the MSE robustness of batching estimators
Proceedings of the 33nd conference on Winter simulation
A noisy-channel approach to question answering
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
A Statistical Classification Approach to Question Answering using Web Data
CW '05 Proceedings of the 2005 International Conference on Cyberworlds
An analysis of the AskMSR question-answering system
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Statistical QA - classifier vs. re-ranker: what's the difference?
MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Open-domain question: answering
Foundations and Trends in Information Retrieval
Baseball: an automatic question-answerer
IRE-AIEE-ACM '61 (Western) Papers presented at the May 9-11, 1961, western joint IRE-AIEE-ACM computer conference
LKR'08 Proceedings of the 3rd international conference on Large-scale knowledge resources: construction and application
Optimizing question answering accuracy by maximizing log-likelihood
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Full machine translation for factoid question answering
EACL 2012 Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra)
Hi-index | 0.00 |
In this paper we present a statistical approach to question answering (QA). Our motivation is to build robust systems for many languages without the need for highly tuned linguistic modules. Consequently, word tokens and web data are used extensively but neither explicit linguistic knowledge nor annotated data is incorporated. A mathematical model for answer retrieval and answer classification is derived. Experiments are conducted by searching for answers in the AQUAINT corpus, as well as in web data. The redundancy inherent in web data outperforms retrieval from a fixed corpus, where there are typically relatively few answer occurrences for any given question. We participated with an implementation of this framework in the TREC 2006 QA evaluations, where we ranked 9th among 27 participants on the factoid task.