HHMM-based Chinese lexical analyzer ICTCLAS
SIGHAN '03 Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
WSEAS Transactions on Computers
Hi-index | 0.00 |
Quantifiers and numerals often cause mistakes in Chinese-Japanese machine translation. In this paper, an approach to quantifier translation is proposed based on the syntactic features after classification. First, morphological analysis is performed on sentences extracted from a Chinese-Japanese aligned corpus, which consists of quantifiers and numerals. Next, statistical information is obtained based on the meaning of the nouns with an accompanying quantifier. Using the difference in type and position of quantifiers between Chinese and Japanese, quantifier translation rules were acquired. Evaluation was conducted by using the acquired translation rules. Finally, the adaptability of the experimental data was verified and the methods achieved an accuracy of 90.75%, showing that they were effective in processing quantifiers and numerals translation.