A set of NP-Extraction rules for portuguese: defining, learning and pruning

  • Authors:
  • Claudia Oliveira;Maria Claudia Freitas;Violeta Quental;Cícero Nogueira dos Santos;Renato Paes Leme;Lucas Souza

  • Affiliations:
  • Departamento de Engenharia de Sistemas, Instituto Militar de Engenharia, Rio de Janeiro, Brazil;Departamento de Letras, Pontifícia Universidade Católica, Rio de Janeiro, Brazil;Departamento de Letras, Pontifícia Universidade Católica, Rio de Janeiro, Brazil;Departamento de Informática, Pontifícia Universidade Católica, Rio de Janeiro, Brazil;Departamento de Engenharia de Sistemas, Instituto Militar de Engenharia, Rio de Janeiro, Brazil;Departamento de Letras, Pontifícia Universidade Católica, Rio de Janeiro, Brazil

  • Venue:
  • PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a set of rules for extracting noun phrases from Portuguese texts. We describe how this set was gradually obtained, starting from a machine learned set of transformation rules that was manually reviewed. The noun phrases extracted by these transformations were given as input to another learner that synthesized rules for breaking up complex noun phrases into simpler ones. The results of these processes applied to a Brazilian Portuguese corpus are evaluated.