Mining Answers in German Web Pages

Authors:
Günter Neumann;Feiyu Xu
Affiliations:
-;-
Venue:
WI '03 Proceedings of the 2003 IEEE/WIC International Conference on Web Intelligence
Year:
2003

Citing 0
Cited 3

Query selection for improved Greek web searches

Proceedings of the 2nd ACM workshop on Improving non english web searching
Adapting a semantic question answering system to the web

MLQA '06 Proceedings of the Workshop on Multilingual Question Answering
Question answering using sentence parsing and semantic network matching

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel method for mining textual answers in German Web pages using semi-structured NL questions and Google for initial document retrieval. We exploit the redundancy on the Web by weighting all identified named entities (NEs) found in the relevantdocument set based on their occurrences and distributions. The ranked NEs are used as our primary anchors for document indexing, paragraph selection, and answer identification. The latter is dependent on two factors: the overlap of terms at different levels (e.g., tokens and named entities) between queries and sentences, and the relevance of identified NEs corresponding to the expected answer type. The set of answer candidates is further subdivided into ranked equivalent classes from which the final answer is selected. The system has been evaluated using question-answer pairs extracted from a popular German quiz book.