Scaling question answering to the web

Authors:
Cody Kwok;Oren Etzioni;Daniel S. Weld
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
ACM Transactions on Information Systems (TOIS)
Year:
2001

Citing 13
Cited 49

MURAX: a robust linguistic approach for question answering using an on-line encyclopedia

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Auto-FAQ: an experiment in cyberspace leveraging

Computer Networks and ISDN Systems
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Grouper: a dynamic clustering interface to Web search results

WWW '99 Proceedings of the eighth international conference on World Wide Web
Focused crawling: a new approach to topic-specific Web resource discovery

WWW '99 Proceedings of the eighth international conference on World Wide Web
Building a question answering test collection

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A Maximum-Entropy-Inspired Parser

A Maximum-Entropy-Inspired Parser
Question Answering from Frequently Asked Question Files: Experiences with the FAQ Finder System

Question Answering from Frequently Asked Question Files: Experiences with the FAQ Finder System
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Nymble: a high-performance learning name-finder

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A new statistical parser based on bigram lexical dependencies

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Experiments with open-domain textual Question Answering

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1

KGCL: A Knowledge-Grid-Based Cooperative Learning Environment

ICWL '02 Proceedings of the First International Conference on Advances in Web-Based Learning
Meta-knowledge Annotation for Efficient Natural-Language Question-Answering

AICS '02 Proceedings of the 13th Irish International Conference on Artificial Intelligence and Cognitive Science
Searching the web: operator assistance required

Information Processing and Management: an International Journal
Domain-specific FAQ retrieval using independent aspects

ACM Transactions on Asian Language Information Processing (TALIP)
Unsupervised named-entity extraction from the web: an experimental study

Artificial Intelligence
Automatic question answering using the web: Beyond the Factoid

Information Retrieval
Semantic Segment Extraction and Matching for Internet FAQ Retrieval

IEEE Transactions on Knowledge and Data Engineering
Is it correct?: towards web-based evaluation of automatic natural language phrase generation

COLING-ACL '06 Proceedings of the COLING/ACL on Interactive presentation sessions
An exploration of the principles underlying redundancy-based factoid question answering

ACM Transactions on Information Systems (TOIS)
SERGEANT: A framework for building more flexible web agents by exploiting a search engine

Web Intelligence and Agent Systems
Autonomously semantifying wikipedia

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Lightweight web-based fact repositories for textual question answering

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Towards temporal web search

Proceedings of the 2008 ACM symposium on Applied computing
Beyond keywords: Automated question answering on the web

Communications of the ACM - Enterprise information integration: and other tools for merging data
Information extraction from Wikipedia: moving down the long tail

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Lexical and Semantic Resources for NLP: From Words to Meanings

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part III
Towards intelligent search assistance for inquiry-based learning

EdAppsNLP 05 Proceedings of the second workshop on Building Educational Applications Using NLP
WebCrow: a WEB-based system for crossword solving

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Searching for common sense: populating Cyc™ from the web

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Intelligence in wikipedia

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
An adaptive context-based algorithm for term weighting: application to single-word question answering

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Unsupervised named-entity extraction from the Web: An experimental study

Artificial Intelligence
Adapting a semantic question answering system to the web

MLQA '06 Proceedings of the Workshop on Multilingual Question Answering
Bringing why-QA to web search

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Search in the lost sense of "query": question formulation in web search queries and its temporal changes

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Knowledge and reasoning for question answering: Research perspectives

Information Processing and Management: an International Journal
Question classification for a Croatian QA system

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Focusing on novelty: a crawling strategy to build diverse language models

Proceedings of the 20th ACM international conference on Information and knowledge management
Hot-spot passage retrieval in question answering

ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
Solving italian crosswords using the web

AI*IA'05 Proceedings of the 9th conference on Advances in Artificial Intelligence
Cracking crosswords: the computer challenge

Reasoning, Action and Interaction in AI Theories and Systems
iQA: an intelligent question answering system

ICADL'05 Proceedings of the 8th international conference on Asian Digital Libraries: implementing strategies and sharing experiences
An approach to answer selection in question-answering based on semantic relations

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Semi-automatically extracting FAQs to improve accessibility of software development knowledge

Proceedings of the 34th International Conference on Software Engineering
Automatic identification of best answers in online enquiry communities

ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Efficient indexing and querying over syntactically annotated trees

Proceedings of the VLDB Endowment
Crowdsourced comprehension: predicting prerequisite structure in Wikipedia

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Pattern learning for relation extraction with a hierarchical topic model

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
A high-performance FAQ retrieval method using minimal differentiator expressions

Knowledge-Based Systems
Predicting website correctness from consensus analysis

Proceedings of the 2012 ACM Research in Applied Computation Symposium
Words context analysis for improvement of information retrieval

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
Numeric Query Answering on the Web

International Journal on Semantic Web & Information Systems
Transfer joint embedding for cross-domain named entity recognition

ACM Transactions on Information Systems (TOIS)
Autonomously reviewing and validating the knowledge base of a never-ending learning system

Proceedings of the 22nd international conference on World Wide Web companion
A cloud of FAQ: A highly-precise FAQ retrieval system for the Web 2.0

Knowledge-Based Systems
Discovering meaning on the go in large heterogenous data

Artificial Intelligence Review
Complex Terminology Extraction Model from Unstructured Web Text Based Linguistic and Statistical Knowledge

International Journal of Information Retrieval Research
Effects of Terms Recognition Mistakes on Requests Processing for Interactive Information Retrieval

International Journal of Information Retrieval Research
Discovering and Characterizing Places of Interest Using Flickr and Twitter

International Journal on Semantic Web & Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The wealth of information on the web makes it an attractive resource for seeking quick answers to simple, factual questions such as "e;who was the first American in space?"e; or "e;what is the second tallest mountain in the world?"e; Yet today's most advanced web search services (e.g., Google and AskJeeves) make it surprisingly tedious to locate answers to such questions. In this paper, we extend question-answering techniques, first studied in the information retrieval literature, to the web and experimentally evaluate their performance.First we introduce Mulder, which we believe to be the first general-purpose, fully-automated question-answering system available on the web. Second, we describe Mulder's architecture, which relies on multiple search-engine queries, natural-language parsing, and a novel voting procedure to yield reliable answers coupled with high recall. Finally, we compare Mulder's performance to that of Google and AskJeeves on questions drawn from the TREC-8 question answering track. We find that Mulder's recall is more than a factor of three higher than that of AskJeeves. In addition, we find that Google requires 6.6 times as much user effort to achieve the same level of recall as Mulder.