Semantic relation extraction from legislative text using generalized syntactic dependencies and support vector machines

  • Authors:
  • Guido Boella;Luigi Di Caro;Livio Robaldo

  • Affiliations:
  • Department of Computer Science, University of Turin, Turin, Italy;Department of Computer Science, University of Turin, Turin, Italy;Department of Computer Science, University of Turin, Turin, Italy

  • Venue:
  • RuleML'13 Proceedings of the 7th international conference on Theory, Practice, and Applications of Rules on the Web
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a technique to automatically extract semantic knowledge from legislative text. Instead of using pattern matching methods relying on lexico-syntactic patterns, we propose a technique which uses syntactic dependencies between terms extracted with a syntactic parser. The idea is that syntactic information are more robust than pattern matching approaches when facing length and complexity of the sentences. Relying on a manually annotated legislative corpus, we transform all the surrounding syntax of the semantic information into abstract textual representations, which are then used to create a classification model by means of a standard Support Vector Machine system. In this work, we initially focus on three different semantic tags, achieving very high accuracy levels on two of them, demonstrating both the limits and the validity of the approach.