Communications of the ACM
Foundations of statistical natural language processing
Foundations of statistical natural language processing
On phoneme—to—character conversion systems in Chinese processing
Journal of the Chinese Institute of Engineers - Chinese speech and language processing
Toward a unified approach to statistical language modeling for Chinese
ACM Transactions on Asian Language Information Processing (TALIP)
Task adaptation in stochastic language model for Chinese homophone disambiguation
ACM Transactions on Asian Language Information Processing (TALIP)
Applying an NVEF word-pair identifier to the Chinese syllable-to-word conversion problem
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Hi-index | 0.01 |
This paper presents a word support model (WSM). The WSM can effectively perform homophone selection and syllable-word segmentation to improve Chinese input systems. The experimental results show that: (1) the WSM is able to achieve tonal (syllables input with four tones) and toneless (syllables input without four tones) syllable-to-word (STW) accuracies of 99% and 92%, respectively, among the converted words; and (2) while applying the WSM as an adaptation processing, together with the Microsoft Input Method Editor 2003 (MSIME) and an optimized bigram model, the average tonal and toneless STW improvements are 37% and 35%, respectively.