Using Finite State Technology in Natural Language Processing of Basque

  • Authors:
  • Iñaki Alegria;Maxux Aranzabe;Nerea Ezeiza;Aitzol Ezeiza;Ruben Urizar

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • CIAA '01 Revised Papers from the 6th International Conference on Implementation and Application of Automata
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the components used in the design and implementation of NLP tools for Basque. These components are based on finite state technology and are devoted to the morphological analysis of Basque, an agglutinative pre-Indo-European language. We think that our design can be interesting for the treatment of other languages. The main components developed are a general and robust morphological analyser/generator and a spelling checker/corrector for Basque named Xuxen. The analyser is a basic tool for current and future work on NLP of Basque, such as the lemmatiser/tagger Euslem, an Intranet search engine or an assistant for verse-making.