Mining natural language answers from the web

Authors:
Günter Neumann;Feiyu Xu
Affiliations:
Language Technology Lab, DFKI, D-66123 Saarbrücken, Germany;Language Technology Lab, DFKI, D-66123 Saarbrücken, Germany
Venue:
Web Intelligence and Agent Systems
Year:
2004

Citing 15
Cited 4

Building a question answering test collection

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Scaling question answering to the Web

Proceedings of the 10th international conference on World Wide Web
Exploiting redundancy in question answering

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic question answering on the web

Proceedings of the 11th international conference on World Wide Web
Towards a theory of natural language interfaces to databases

Proceedings of the 8th international conference on Intelligent user interfaces
Bootstrapping an ontology-based information extraction system

Intelligent exploration of the web
Analyses for elucidating current question answering technology

Natural Language Engineering
Answer extraction

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Nymble: a high-performance learning name-finder

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
The NYU system for MUC-6 or where's the syntax?

MUC6 '95 Proceedings of the 6th conference on Message understanding
Learning surface text patterns for a Question Answering system

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Looking under the hood: tools for diagnosing your question answering engine

ODQA '01 Proceedings of the workshop on Open-domain question answering - Volume 12
Language independent NER using a unified model of internal and contextual evidence

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Unsupervised personal name disambiguation

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
AnswerBus question answering system

HLT '02 Proceedings of the second international conference on Human Language Technology Research

VisiQ: Supporting visual and interactive query refinement

Web Intelligence and Agent Systems
Wrapping VRXQuery with self-adaptive fuzzy capabilities

Web Intelligence and Agent Systems
Language independent answer prediction from the web

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Towards improving the online shopping experience: A client-based platform for post-processing Web search results

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel method for mining textual answers in Web pages using semi-structured NL questions and Google for initial document retrieval. We exploit the redundancy on the Web by weighting all identified named entities (NEs) found in the relevant document set based on their occurrences and distributions. The ranked NEs are used as our primary anchors for document indexing, paragraph selection, and answer identification. The latter is dependent on two factors: the overlap of terms at different levels (e.g., tokens and named entities) between queries and sentences, and the relevance of identified NEs corresponding to the expected answer type. The set of answer candidates is further subdivided into ranked equivalent classes from which the final answer is selected. The system has been evaluated using question-answer pairs extracted from a popular German quiz book.