Tagging English text with a probabilistic model
Computational Linguistics
A practical part-of-speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Tagging French: comparing a statistical and a constraint-based method
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
An information extraction core system for real world German text processing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Hi-index | 0.00 |
This paper describes a Spanish Part-of-Speech (POS) tagger which applies and extends Brill's algorithm for unsupervised learning of rule-based taggers (Brill, 1995). First, we discuss our general approach including extensions we made to the algorithm in order to handle unknown words and parameterize learning and tagging options. Next, we report and analyze our experimental results using different parameters. Then, we describe our "hybrid" approach which was necessary in order to overcome a fundamental limitation in Brill's original algorithm. Finally, we compare our tagger with Hidden Markov Model (HMM)-based taggers.