Dependency syntax analysis using grammar induction and a lexical categories precedence system
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Developing a competitive HMM arabic POS tagger using small training corpora
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Hi-index | 0.00 |
Part of Speech (POS) tagger is a necessary module inmany natural language text processing tasks. A POS taggeris a program that accepts an unprepared raw text ininput and to each word adds a tag specifying its grammaticalproperties, such as part of speech, number, person,etc. One of popular POS taggers-TnT tagger-hasbeen extensively tested for English and some other languages.This paper reports on it evaluation for Spanishlanguage. Error analysis is reported, explaining howsome specific features of Spanish language affect taggerperformance. It is reported that on Spanish texts TnTshows overall tagging accuracy between 92.95% and95.84%, specifically, between 95.47% and 98.56% onknown words and between 75.57% and 83.49% on unknownwords. Results show that TnT has reached a goodlevel of maturity and is helpful enough for NLP tasks.