A set of NP-Extraction rules for portuguese: defining, learning and pruning

Authors:
Claudia Oliveira;Maria Claudia Freitas;Violeta Quental;Cícero Nogueira dos Santos;Renato Paes Leme;Lucas Souza
Affiliations:
Departamento de Engenharia de Sistemas, Instituto Militar de Engenharia, Rio de Janeiro, Brazil;Departamento de Letras, Pontifícia Universidade Católica, Rio de Janeiro, Brazil;Departamento de Letras, Pontifícia Universidade Católica, Rio de Janeiro, Brazil;Departamento de Informática, Pontifícia Universidade Católica, Rio de Janeiro, Brazil;Departamento de Engenharia de Sistemas, Instituto Militar de Engenharia, Rio de Janeiro, Brazil;Departamento de Letras, Pontifícia Universidade Católica, Rio de Janeiro, Brazil
Venue:
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Year:
2006

Citing 5
Cited 1

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
The role of lexicalization and pruning for base noun phrase grammars

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Error-driven pruning of Treebank grammars for base noun phrase identification

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Noun-phrase analysis in unrestricted text for information retrieval

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Man vs. machine: a case study in base noun phrase learning

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics

Readability assessment for text simplification

IUNLPBEA '10 Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a set of rules for extracting noun phrases from Portuguese texts. We describe how this set was gradually obtained, starting from a machine learned set of transformation rules that was manually reviewed. The noun phrases extracted by these transformations were given as input to another learner that synthesized rules for breaking up complex noun phrases into simpler ones. The results of these processes applied to a Brazilian Portuguese corpus are evaluated.