Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Tiered Tagging and Combined Language Models Classifiers
TSD '99 Proceedings of the Second International Workshop on Text, Speech and Dialogue
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Transformation-based learning in the fast lane
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
MorphSlav '03 Proceedings of the 2003 EACL Workshop on Morphological Processing of Slavic Languages
Towards the adequate evaluation of morphosyntactic taggers
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Effective architecture of the polish tagger
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Towards the lemmatisation of polish nominal syntactic groups using a shallow grammar
SIIS'11 Proceedings of the 2011 international conference on Security and Intelligent Information Systems
Polish language processing chains for multilingual information systems
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems
Induction of dependency structures based on weighted projection
ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
Coreference annotation schema for an inflectional language
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Using part of speech n-grams for improving automatic speech recognition of polish
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Hi-index | 0.00 |
In this paper we present and evaluate a Brill morphosyntactic transformation-based tagger adapted for specifics of highly inflectional languages. Multi-phase tagging with grammatical category matching transformations and lexical transformations brings significant accuracy improvements comparing to previous work. Evaluation shows the accuracy of 92.44% for the Polish language which is higher than the same metric for the other known taggers of Polish: stochastic trigram tagger (90.59%) and hybrid tagger TaKIPI employing decision tree classifier and automatically extracted rule-based tagger used for tagging the IPI PAN Corpus of Polish (91.06%).