Scaling textual inference to the web

Authors:
Stefan Schoenmackers;Oren Etzioni;Daniel S. Weld
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Year:
2008

Citing 18
Cited 16

Principles of database and knowledge-base systems, Vol. I

Principles of database and knowledge-base systems, Vol. I
Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
Scaling question answering to the Web

Proceedings of the 10th international conference on World Wide Web
Experiments with open-domain textual Question Answering

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Unsupervised named-entity extraction from the web: an experimental study

Artificial Intelligence
Markov logic networks

Machine Learning
An analysis of the AskMSR question-answering system

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A logic-based semantic approach to recognizing textual entailment

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning)

Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning)
Autonomously semantifying wikipedia

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Memory-efficient inference in relational domains

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Robust textual inference via learning and abductive reasoning

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Open knowledge extraction through compositional language processing

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
An inference model for semantic entailment in natural language

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Natural logic for textual inference

RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Discriminative probabilistic models for relational data

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence

Quantifier scope disambiguation using extracted pragmatic knowledge: preliminary results

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Using linked open data to bootstrap corporate knowledge management in the OrganiK project

Proceedings of the 6th International Conference on Semantic Systems
Using Background Knowledge to Support Coreference Resolution

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Machine reading at the University of Washington

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Collective cross-document relation extraction without labelled data

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Learning first-order Horn clauses from web text

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Identifying functional relations in web text

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Random walk inference and learning in a large scale knowledge base

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A probabilistic lexical model for ranking textual inferences

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Learning to "read between the lines" using Bayesian logic programs

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Knowledge extraction and joint inference using tractable Markov logic

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Automatic knowledge base construction using probabilistic extraction, deductive reasoning, and human feedback

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Probabilistic databases of universal schema

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Discovering logical knowledge for deep question answering

Proceedings of the 21st ACM international conference on Information and knowledge management
Open domain knowledge extraction: inference on a web scale

Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Universal schema for entity type prediction

Proceedings of the 2013 workshop on Automated knowledge base construction

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most Web-based Q/A systems work by finding pages that contain an explicit answer to a question. These systems are helpless if the answer has to be inferred from multiple sentences, possibly on different pages. To solve this problem, we introduce the Holmes system, which utilizes textual inference (TI) over tuples extracted from text. Whereas previous work on TI (e.g., the literature on textual entailment) has been applied to paragraph-sized texts, Holmes utilizes knowledge-based model construction to scale TI to a corpus of 117 million Web pages. Given only a few minutes, Holmes doubles recall for example queries in three disparate domains (geography, business, and nutrition). Importantly, Holmes's runtime is linear in the size of its input corpus due to a surprising property of many textual relations in the Web corpus---they are "approximately" functional in a well-defined sense.