Learning to find answers to questions on the Web

Authors:
Eugene Agichtein;Steve Lawrence;Luis Gravano
Affiliations:
Columbia University;NEC Research Institute;Columbia University
Venue:
ACM Transactions on Internet Technology (TOIT)
Year:
2004

Citing 27
Cited 16

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
On term selection for query expansion

Journal of Documentation
WordNet: a lexical database for English

Communications of the ACM
On relevance weights with little relevance information

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Improving automatic query expansion

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Results and challenges in Web search evaluation

WWW '99 Proceedings of the eighth international conference on World Wide Web
Indexing and retrieval of scientific literature

Proceedings of the eighth international conference on Information and knowledge management
Improving the effectiveness of information retrieval with local context analysis

ACM Transactions on Information Systems (TOIS)
Bridging the lexical chasm: statistical approaches to answer-finding

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Elicitation queries to the excite Web search engine

Proceedings of the ninth international conference on Information and knowledge management
Retrieving descriptive phrases from large amounts of free text

Proceedings of the ninth international conference on Information and knowledge management
Scaling question answering to the Web

Proceedings of the 10th international conference on World Wide Web
Learning search engine specific query transformations for question answering

Proceedings of the 10th international conference on World Wide Web
Mining the web for answers to natural language questions

Proceedings of the tenth international conference on Information and knowledge management
Probabilistic question answering on the web

Proceedings of the 11th international conference on World Wide Web
Context and Page Analysis for Improved Web Search

IEEE Internet Computing
A Real World Implementation of Answer Extraction

DEXA '98 Proceedings of the 9th International Workshop on Database and Expert Systems Applications
Improving Category Specific Web Search by Learning Query Modifications

SAINT '01 Proceedings of the 2001 Symposium on Applications and the Internet (SAINT 2001)
Examining the role of statistical and linguistic knowledge sources in a general-knowledge question-answering system

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Answer extraction

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A simple rule-based part of speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
Role of verbs in document analysis

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Experiments with open-domain textual Question Answering

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Experiments in automated lexicon building for text searching

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Learning how to answer questions using trivia games

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Statistical answer-type identification in open-domain question answering

HLT '02 Proceedings of the second international conference on Human Language Technology Research

The infocious web search engine: improving web searching through linguistic analysis

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Retrieving answers from frequently asked questions pages on the web

Proceedings of the 14th ACM international conference on Information and knowledge management
Automatic question answering using the web: Beyond the Factoid

Information Retrieval
An exploration of the principles underlying redundancy-based factoid question answering

ACM Transactions on Information Systems (TOIS)
Quality-aware collaborative question answering: methods and evaluation

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Automatic keyword prediction using Google similarity distance

Expert Systems with Applications: An International Journal
Model tree learning for query term weighting in question answering

ECIR'07 Proceedings of the 29th European conference on IR research
Word AdHoc Network: Using Google Core Distance to extract the most relevant information

Knowledge-Based Systems
Using Google latent semantic distance to extract the most relevant information

Expert Systems with Applications: An International Journal
Addressing people's information needs directly in a web search result page

Proceedings of the 20th international conference on World wide web
How to extract Arabic definitions from the web? Arabic definition question answering system

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Human-machine design considerations in advanced machine-learning systems

IBM Journal of Research and Development
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
Machine learning for query formulation in question answering

Natural Language Engineering
Web-based unsupervised learning for query formulation in question answering

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Direct answers for search queries in the long tail

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce a method for learning to find documents on the Web that contain answers to a given natural language question. In our approach, questions are transformed into new queries aimed at maximizing the probability of retrieving answers from existing information retrieval systems. The method involves automatically learning phrase features for classifying questions into different types, automatically generating candidate query transformations from a training set of question/answer pairs, and automatically evaluating the candidate transformations on target information retrieval systems such as real-world general purpose search engines. At run-time, questions are transformed into a set of queries, and reranking is performed on the documents retrieved. We present a prototype search engine, Tritus, that applies the method to Web search engines. Blind evaluation on a set of real queries from a Web search engine log shows that the method significantly outperforms the underlying search engines, and outperforms a commercial search engine specializing in question answering. Our methodology cleanly supports combining documents retrieved from different search engines, resulting in additional improvement with a system that combines search results from multiple Web search engines.