Unsupervised learning of a rule-based Spanish Part of Speech tagger

  • Authors:
  • Chinatsu Aone;Kevin Hausman

  • Affiliations:
  • Systems Research and Applications Corporation (SRA), Fairfax, VA;Systems Research and Applications Corporation (SRA), Fairfax, VA

  • Venue:
  • COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a Spanish Part-of-Speech (POS) tagger which applies and extends Brill's algorithm for unsupervised learning of rule-based taggers (Brill, 1995). First, we discuss our general approach including extensions we made to the algorithm in order to handle unknown words and parameterize learning and tagging options. Next, we report and analyze our experimental results using different parameters. Then, we describe our "hybrid" approach which was necessary in order to overcome a fundamental limitation in Brill's original algorithm. Finally, we compare our tagger with Hidden Markov Model (HMM)-based taggers.