The Penn Chinese TreeBank: Phrase structure annotation of a large corpus
Natural Language Engineering
PCFG parsing for restricted classical Chinese texts
SIGHAN '02 Proceedings of the first SIGHAN workshop on Chinese language processing - Volume 18
Generating Chinese couplets using a statistical MT approach
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Discriminative reordering with Chinese grammatical relations features
SSST '09 Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation
A classical Chinese corpus with nested part-of-speech tags
LaTeCH '12 Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Hi-index | 0.00 |
As interest grows in the use of linguistically annotated corpora in research and teaching of foreign languages and literature, treebanks of various historical texts have been developed. We introduce the first large-scale dependency treebank for Classical Chinese literature. Derived from the Stanford dependency types, it consists of over 32K characters drawn from a collection of poems written in the 8th century CE. We report on the design of new dependency relations, discuss aspects of the annotation process and evaluation, and illustrate its use in a study of parallelism in Classical Chinese poetry.