Graphemic approximation of phonological context for English-Chinese transliteration

Authors:
Oi Yee Kwong
Affiliations:
City University of Hong Kong, Kowloon, Hong Kong
Venue:
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Year:
2009

Citing 9
Cited 0

A stochastic finite-state word-segmentation algorithm for Chinese

Computational Linguistics
The TREC-5 Confusion Track: Comparing Retrieval Methods for Scanned Text

Information Retrieval
Machine transliteration

Computational Linguistics
Transliteration of proper names in cross-lingual information retrieval

MultiNER '03 Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15
A joint source-channel model for machine transliteration

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Unsupervised named entity transliteration using temporal and phonetic correlation

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Homophones and tonal patterns in English-Chinese transliteration

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Extraction of transliteration pairs from parallel corpora using a statistical transliteration model

Information Sciences: an International Journal
An ensemble of grapheme and phoneme for machine transliteration

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although direct orthographic mapping has been shown to outperform phoneme-based methods in English-to-Chinese (E2C) transliteration, it is observed that phonological context plays an important role in resolving graphemic ambiguity. In this paper, we investigate the use of surface graphemic features to approximate local phonological context for E2C. In the absence of an explicit phonemic representation of the English source names, experiments show that the previous and next character of a given English segment could effectively capture the local context affecting its expected pronunciation, and thus its rendition in Chinese.