Iterative part-of-speech tagging

  • Authors:
  • Alípio Jorge;Alneu de Andrade Lopes

  • Affiliations:
  • Univ. do Porto, Porto, Portugal;Univ. do Porto, Porto, Portugal

  • Venue:
  • Learning language in logic
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Assigning a category to a given word (tagging) depends on the particular word and on the categories (tags) of neighboring words. A theory that is able to assign tags to a given text can naturally be viewed as a recursive logic program. This article describes how iterative induction, a technique that has been proven powerful in the synthesis of recursive logic programs, has been applied to the task of part-of-speech tagging. The main strategy consists of inducing a succession T1, T2,..., Tn of theories, using in the induction of theory Ti all the previously induced theories. Each theory in the sequence may have lexical rules, context rules and hybrid ones. This iterative strategy is, to a large extent, independent of the inductive algorithm underneath. Here we consider one particular relational learning algorithm, CSC(RC), and we induce first order theories from positive examples and background knowledge that are able to successfully tag a relatively large corpus in Portuguese.