COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Tagging inflective languages: prediction of morphological categories for a rich, structured tagset
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
MULTEXT: Multilingual Text Tools and Corpora
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Hi-index | 0.00 |
The paper describes the morphologically annotated corpus formed by the Czech translation of George Orwell's novel Nineteen-Eighty Four. It also presents frequencies of some morphosyntactic features and focuses on syntactic structure of noun-ended prepositional phrases in the corpus with emphasis laid on grammatical concord in these structures. The study of these structures serves for the development of the formal grammar used for morphosyntactic rule-based tagging of Czech texts.