The Linguistic Basis of a Rule-Based Tagger of Czech
TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue
Probabilistic and rule-based tagger of an inflective language: a comparison
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A probabilistic morphological analyzer for Syriac
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Hi-index | 0.00 |
This paper describes a part of one of the most important syntactic subsystems present in many inflectional languages - grammatical agreement - from the viewpoint of automatic morphological disambiguation of such languages. One of the languages on which the main ideas will be demonstrated is Czech which - due to its morphological and syntactic complexity - can be regarded as a representative of the inflectional subgroup of the Slavic language family. It will be shown that notwithstanding the intricacies of the syntax of Czech a deeper understanding of the nature of grammatical agreement can result in the development of surface syntax rules which can considerably contribute to solving the problem of automatic morphological disambiguation of texts stored in Czech corpora. Although the language being studied is only Czech the ideas presented seem to be applicable, mutatis mutandis, also to the morphological disambiguation of a si-milar type of languages, especially the Slavic ones.