Dependency-based open information extraction

  • Authors:
  • Pablo Gamallo;Marcos Garcia;Santiago Fernández-Lanza

  • Affiliations:
  • Universidade de Santiago de Compostela;Universidade de Santiago de Compostela;Universidade de Vigo

  • Venue:
  • ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Building shallow semantic representations from text corpora is the first step to perform more complex tasks such as text entailment, enrichment of knowledge bases, or question answering. Open Information Extraction (OIE) is a recent unsupervised strategy to extract billions of basic assertions from massive corpora, which can be considered as being a shallow semantic representation of those corpora. In this paper, we propose a new multilingual OIE system based on robust and fast rule-based dependency parsing. It permits to extract more precise assertions (verb-based triples) from text than state of the art OIE systems, keeping a crucial property of those systems: scaling to Web-size document collections.