English-to-Korean transliteration using multiple unbounded overlapping phoneme chunks

Authors:
In-Ho Kang;GilChang Kim
Affiliations:
Korea Advanced Institute of Science and Technology;Korea Advanced Institute of Science and Technology
Venue:
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Year:
2000

Citing 3
Cited 15

A maximum entropy approach to natural language processing

Computational Linguistics
Approximate String Matching

ACM Computing Surveys (CSUR)
Machine transliteration

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics

Automatic generation of Japanese–English bilingual thesauri based on bilingual corpora

Journal of the American Society for Information Science and Technology - Research Articles
An ensemble of transliteration models for information retrieval

Information Processing and Management: an International Journal
A machine transliteration model based on correspondence between graphemes and phonemes

ACM Transactions on Asian Language Information Processing (TALIP)
Concept unification of terms in different languages for IR

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Punjabi machine transliteration

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A phonetic similarity model for automatic extraction of transliteration pairs

ACM Transactions on Asian Language Information Processing (TALIP)
A Hybrid Technique for English-Chinese Cross Language Information Retrieval

ACM Transactions on Asian Language Information Processing (TALIP)
A comparison of different machine transliteration models

Journal of Artificial Intelligence Research
Web-Based Transliteration of Person Names

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Machine transliteration survey

ACM Computing Surveys (CSUR)
Non-productive machine transliteration

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Improving machine transliteration performance by using multiple transliteration models

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
An ensemble of grapheme and phoneme for machine transliteration

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Improving transliteration with precise alignment of phoneme chunks and using contextual features

AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Translation techniques in cross-language information retrieval

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present in this paper the method of English-to-Korean (E-K) transliteration and back-transliteration. In Korean technical documents, many English words are transliterated into Korean words in various forms in diverse ways. As English words and Korean transliterations are usually technical terms and proper nouns, it is hard to find a transliteration and its variations in a dictionary. Therefore an automatic transliteration system is needed to find the transliterations of English words without manual intervention.To explain E-K transliteration phenomena, we use phoneme chunks that do not have a length limit. By applying phoneme chunks, we combine different length information with easy. The E-K transliteration method has three steps. In the first, we make a phoneme network that shows all possible transliterations of the given word. In the second step, we apply phoneme chunks, extracted from training data, to calculate the reliability of each possible transliteration. Then we obtain probable transliterations of the given English word.