Mining the web for answers to natural language questions

Authors:
Dragomir R. Radev;Hong Qi;Zhiping Zheng;Sasha Blair-Goldensohn;Zhu Zhang;Weiguo Fan;John Prager
Affiliations:
University of Michigan, Ann Arbor, MI;University of Michigan, Ann Arbor, MI;University of Michigan, Ann Arbor, MI;University of Michigan, Ann Arbor, MI;University of Michigan, Ann Arbor, MI;University of Michigan, Ann Arbor, MI;IBM TJ Watson Research Center, Hawthorne, NY
Venue:
Proceedings of the tenth international conference on Information and knowledge management
Year:
2001

Citing 16
Cited 35

A statistical approach to machine translation

Computational Linguistics
Statistical methods for speech recognition

Statistical methods for speech recognition
Improving automatic query expansion

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Question-answering by predictive annotation

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Web Search---Your Way

Communications of the ACM
Statistics-Based Summarization - Step One: Sentence Compression

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Machine transliteration

Computational Linguistics
Ranking suspected answers to natural language questions using predictive annotation

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Headline generation based on statistical translation

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
The Candide system for machine translation

HLT '94 Proceedings of the workshop on Human Language Technology

Getting answers to natural language questions on the web

Journal of the American Society for Information Science and Technology
Probabilistic question answering on the web

Proceedings of the 11th international conference on World Wide Web
On the MSE robustness of batching estimators

Proceedings of the 33nd conference on Winter simulation
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
Is question answering an acquired skill?

Proceedings of the 13th international conference on World Wide Web
Learning to find answers to questions on the Web

ACM Transactions on Internet Technology (TOIT)
FADA: find all distinct answers

Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Learning by googling

ACM SIGKDD Explorations Newsletter
Probabilistic question answering on the Web: Research Articles

Journal of the American Society for Information Science and Technology
Sampling search-engine results

WWW '05 Proceedings of the 14th international conference on World Wide Web
Domain-specific FAQ retrieval using independent aspects

ACM Transactions on Asian Language Information Processing (TALIP)
Is it the right answer?: exploiting web redundancy for Answer Validation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Synonymous collocation extraction using translation information

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
An analysis of the AskMSR question-answering system

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Optimizing synonym extraction using monolingual and bilingual resources

PARAPHRASE '03 Proceedings of the second international workshop on Paraphrasing - Volume 16
Automatic question answering using the web: Beyond the Factoid

Information Retrieval
Generating page clippings from web search results using a dynamically terminated genetic algorithm

Information Systems
Open-domain question: answering

Foundations and Trends in Information Retrieval
Semantic verification in an online fact seeking environment

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Improving the performance of question answering with semantically equivalent answer patterns

Data & Knowledge Engineering
Lexical and Semantic Resources for NLP: From Words to Meanings

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part III
Exploring models for semantic category verification

Information Systems
Exploring models for semantic category verification

Information Systems
Honto? search: estimating trustworthiness of web information by search results aggregation and temporal analysis

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Intelligent answering location questions from the web using molecular alignment

Journal of Intelligent Information Systems
'How may I help you'-spoken queries for technical assistance

Proceedings of the 48th Annual Southeast Regional Conference
Versatile question answering systems: seeing in synthesis

International Journal of Intelligent Information and Database Systems
Searching the world wide web for local services and facilities: a review on the patterns of location-based queries

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
Automatic acquisition of semantic-based question reformulations for question answering

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Machine learning for query formulation in question answering

Natural Language Engineering
Web-based unsupervised learning for query formulation in question answering

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Using semantic constraints to improve question answering

NLDB'06 Proceedings of the 11th international conference on Applications of Natural Language to Information Systems
A case study of using web search statistics: case restoration

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Web-Based multiple choice question answering for english and arabic questions

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
A KE-LSA approach for user-centered design

Journal of Intelligent Manufacturing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The web is now becoming one of the largest information and knowledge repositories. Many large scale search engines (Google, Fast, Northern Light, etc.) have emerged to help users find information. In this paper, we study how we can effectively use these existing search engines to mine the Web and discover the "correct" answers to factual natural language questions.We propose a probabilistic algorithm called QASM (Question Answering using Statistical Models) that learns the best query paraphrase of a natural language question. We validate our approach for both local and web search engines using questions from the TREC evaluation. We also show how this algorithm can be combined with another algorithm (AnSel) to produce precise answers to natural language questions.