Semantic role labeling for open information extraction

  • Authors:
  • Janara Christensen; Mausam;Stephen Soderland;Oren Etzioni

  • Affiliations:
  • University of Washington, Seattle;University of Washington, Seattle;University of Washington, Seattle;University of Washington, Seattle

  • Venue:
  • FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Open Information Extraction is a recent paradigm for machine reading from arbitrary text. In contrast to existing techniques, which have used only shallow syntactic features, we investigate the use of semantic features (semantic roles) for the task of Open IE. We compare TextRunner (Banko et al., 2007), a state of the art open extractor, with our novel extractor SRL-IE, which is based on UIUC's SRL system (Punyakanok et al., 2008). We find that SRL-IE is robust to noisy heterogeneous Web data and outperforms TextRunner on extraction quality. On the other hand, TextRunner performs over 2 orders of magnitude faster and achieves good precision in high locality and high redundancy extractions. These observations enable the construction of hybrid extractors that output higher quality results than TextRunner and similar quality as SRL-IE in much less time.