Computational Linguistics - Special issue on using large corpora: I
Aligning sentences in parallel corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
A program for aligning sentences in bilingual corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Aligning a parallel English-Chinese corpus statistically with lexical criteria
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Learning translation templates from bilingual text
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Chinese-Japanese clause alignment
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Hi-index | 0.00 |
Bi-text alignment is useful to many Natural Language Processing tasks such as machine translation, bilingual lexicography and word sense disambiguation. Most of previous researches are on different language pairs. This paper presents a diachronic alignment of Ancient and Modern Chinese. Because of the long history of Chinese culture and Chinese writing, lots of Ancient Chinese texts are waiting to be translated into modern Chinese, especially, the comparative study of Ancient and Modern Chinese is a very important way to understand some characteristics in Modern Chinese. After describing some characteristics in Ancient-Modern Chinese bi-texts, we first investigate some statistical properties of Ancient-Modern bi-text corpus, including the correlation test of text lengths between two languages and the distribution test of length ratio data. We then pay more attention to n-m(n1 or m1) alignment modes which are prone to mismatch.