Semantic role labeling for open information extraction

Authors:
Janara Christensen; Mausam;Stephen Soderland;Oren Etzioni
Affiliations:
University of Washington, Seattle;University of Washington, Seattle;University of Washington, Seattle;University of Washington, Seattle
Venue:
FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Year:
2010

Citing 17
Cited 6

Learning Information Extraction Rules for Semi-Structured and Free Text

Machine Learning - Special issue on natural language learning
Snowball: extracting relations from large plain-text collections

DL '00 Proceedings of the fifth ACM conference on Digital libraries
The Berkeley FrameNet Project

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Preemptive information extraction using unrestricted relation discovery

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
A global joint model for semantic role labeling

Computational Linguistics
Tree kernels for semantic role labeling

Computational Linguistics
The importance of syntactic parsing and inference in semantic role labeling

Computational Linguistics
Using Wikipedia to bootstrap open information extraction

ACM SIGMOD Record
Machine reading

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
The effect of syntactic representation on semantic role labeling

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Shallow semantic parsing for spoken language understanding

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Coupling semi-supervised learning of categories and relations

SemiSupLearn '09 Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing
Open information extraction from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Open knowledge extraction through compositional language processing

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
A probabilistic model of redundancy in information extraction

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Unsupervised semantic parsing

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Open information extraction for the web

Open information extraction for the web

Machine reading at the University of Washington

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Relation guided bootstrapping of semantic lexicons

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Identifying relations for open information extraction

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Efficient indexing and querying over syntactically annotated trees

Proceedings of the VLDB Endowment
ClausIE: clause-based open information extraction

Proceedings of the 22nd international conference on World Wide Web
Mining semantics for culturomics: towards a knowledge-based approach

Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Open Information Extraction is a recent paradigm for machine reading from arbitrary text. In contrast to existing techniques, which have used only shallow syntactic features, we investigate the use of semantic features (semantic roles) for the task of Open IE. We compare TextRunner (Banko et al., 2007), a state of the art open extractor, with our novel extractor SRL-IE, which is based on UIUC's SRL system (Punyakanok et al., 2008). We find that SRL-IE is robust to noisy heterogeneous Web data and outperforms TextRunner on extraction quality. On the other hand, TextRunner performs over 2 orders of magnitude faster and achieves good precision in high locality and high redundancy extractions. These observations enable the construction of hybrid extractors that output higher quality results than TextRunner and similar quality as SRL-IE in much less time.