Using word support model to improve Chinese input system

Authors:
Jia-Lin Tsai
Affiliations:
Tung Nan Institute of Technology, Taipei, Taiwan
Venue:
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Year:
2006

Citing 6
Cited 0

Six-digit coding method

Communications of the ACM
Foundations of statistical natural language processing

Foundations of statistical natural language processing
On phoneme—to—character conversion systems in Chinese processing

Journal of the Chinese Institute of Engineers - Chinese speech and language processing
Toward a unified approach to statistical language modeling for Chinese

ACM Transactions on Asian Language Information Processing (TALIP)
Task adaptation in stochastic language model for Chinese homophone disambiguation

ACM Transactions on Asian Language Information Processing (TALIP)
Applying an NVEF word-pair identifier to the Chinese syllable-to-word conversion problem

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper presents a word support model (WSM). The WSM can effectively perform homophone selection and syllable-word segmentation to improve Chinese input systems. The experimental results show that: (1) the WSM is able to achieve tonal (syllables input with four tones) and toneless (syllables input without four tones) syllable-to-word (STW) accuracies of 99% and 92%, respectively, among the converted words; and (2) while applying the WSM as an adaptation processing, together with the Microsoft Input Method Editor 2003 (MSIME) and an optimized bigram model, the average tonal and toneless STW improvements are 37% and 35%, respectively.