PHORA: A NLP System for Spanish

  • Authors:
  • Manuel Palomar;Maximiliano Saiz-Noeda;Rafael Muñoz;Armando Suárez;Patricio Martínez-Barco;Andrés Montoyo

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • CICLing '01 Proceedings of the Second International Conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2001

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we present a whole Natural Language Processing (NLP) system for Spanish. The core of this system is the parser, which uses the grammatical formalism Lexical-Functional Grammars (LFG). The system uses the Specification Marks Method in order to resolve the lexical ambiguity. Another important component of this system is the anaphora resolution module. To solve the anaphora, this module contains a method based on linguistic information (lexical, morphological, syntactic and semantic), structural information (anaphoric accessibility space in which the anaphor obtains the antecedent) and statistical information. This method is based on constraints and preferences and solves pronouns and definite descriptions. Moreover, this system fits dialogue and non-dialogue discourse features. The anaphora resolution module uses several resources, such as a lexical database (Spanish WordNet) to provide semantic information and a POS tagger providing the part of speech for each word and its root to make this resolution process easier.