Grammatical category disambiguation by statistical optimization
Computational Linguistics
Neural Networks - 2004 Special issue: New developments in self-organizing systems
The role of lexical resources in CJK natural language processing
MLRI '06 Proceedings of the Workshop on Multilingual Language Resources and Interoperability
Acquiring translational equivalence from a japanese-chinese parallel corpus
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
The contribution of lexical resources to natural language processing of CJK languages
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Hi-index | 0.00 |
This paper poses a new method for Chinese language corpus processing. Unlike the past researches, our approach has following charactericstics: it blends segmenation with tagging and integrates rule-based approach with statistics-based one in grammatical disambiguation. The principal ideas presented in the paper are incorporated in the development of a Chinese corpus processing system. Experimental results prove that the overall accuracy for segmentation is 97.68% and that for tagging is 94.55% in about 400,000 Chinese characters.