Open information extraction: the second generation

Authors:
Oren Etzioni;Anthony Fader;Janara Christensen;Stephen Soderland;Mausam Mausam
Affiliations:
Turing Center, Department of Computer Science and Engineering, University of Washington, Seattle, WA;Turing Center, Department of Computer Science and Engineering, University of Washington, Seattle, WA;Department of Computer Science and Engineering, University of Washington, Seattle, WA;Turing Center, Department of Computer Science and Engineering, University of Washington, Seattle, WA;Turing Center, Department of Computer Science and Engineering, University of Washington, Seattle, WA
Venue:
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Year:
2011

Citing 19
Cited 21

Learning Information Extraction Rules for Semi-Structured and Free Text

Machine Learning - Special issue on natural language learning
Corpus-based method for automatic identification of support verbs for nominalizations

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Preemptive information extraction using unrestricted relation discovery

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
StatSnowball: a statistical approach to extracting entity relationships

Proceedings of the 18th international conference on World wide web
Machine reading

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Statistical measures of the semi-productivity of light verb constructions

MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing
Unsupervised methods for determining object and relation synonyms on the web

Journal of Artificial Intelligence Research
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Distant supervision for relation extraction without labeled data

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Coupled semi-supervised learning for information extraction

Proceedings of the third ACM international conference on Web search and data mining
Open information extraction using Wikipedia

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Learning 5000 relational extractors

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A latent dirichlet allocation method for selectional preferences

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Automatically generating extraction patterns from untagged text

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Learning first-order Horn clauses from web text

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Identifying functional relations in web text

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Knowledge-based weak supervision for information extraction of overlapping relations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Global learning of typed entailment rules

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1

A rule-based human interpretation system for semantic textual similarity task

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Dependency-based open information extraction

ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
Semi-supervised learning for automatic conceptual property extraction

CMCL '12 Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics
Open language learning for information extraction

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Knowledge extraction and joint inference using tractable Markov logic

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Coupled bayesian sets algorithm for semi-supervised learning and information extraction

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Automatic typing of DBpedia entities

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
An evidence-based verification approach to extract entities and relations for knowledge base population

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
Wikipedia entity expansion and attribute extraction from the web using semi-supervised learning

Proceedings of the sixth ACM international conference on Web search and data mining
Towards web-scale structured web data extraction

Proceedings of the sixth ACM international conference on Web search and data mining
DEBORA: dependency-based method for extracting entity-relationship triples from open-domain texts in polish

ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
A model for information extraction in portuguese based on text patterns

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Open domain knowledge extraction: inference on a web scale

Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
ClausIE: clause-based open information extraction

Proceedings of the 22nd international conference on World Wide Web
Exploiting unstructured web information for managing linked data spaces

Proceedings of the 17th Panhellenic Conference on Informatics
A study of the knowledge base requirements for passing an elementary science test

Proceedings of the 2013 workshop on Automated knowledge base construction
INDREX: in-database distributional relation extraction

Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Aggregated search: A new information retrieval paradigm

ACM Computing Surveys (CSUR)
Statistical relational data integration for information extraction

RW'13 Proceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access
Extraction and integration of partially overlapping web sources

Proceedings of the VLDB Endowment
Acquisition of open-domain classes via intersective semantics

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

How do we scale information extraction to the massive size and unprecedented heterogeneity of the Web corpus? Beginning in 2003, our KnowItAll project has sought to extract high-quality knowledge from the Web. In 2007, we introduced the Open Information Extraction (Open IE) paradigm which eschews hand-labeled training examples, and avoids domain-specific verbs and nouns, to develop unlexicalized, domain-independent extractors that scale to the Web corpus. Open IE systems have extracted billions of assertions as the basis for both common-sense knowledge and novel question-answering systems. This paper describes the second generation of Open IE systems, which rely on a novel model of how relations and their arguments are expressed in English sentences to double precision/recall compared with previous systems such as TEXTRUNNER and WOE.