Machine Learning Based English-to-Korean Transliteration Using Grapheme and Phoneme Information

  • Authors:
  • Jong-Hoon Oh;Key-Sun Choi

  • Affiliations:
  • The authors are with the Dept. of Computer Science, KAIST, 373--1 Guseong-Dong, Yuseong-Gu, Daejeon 305--701 Republic of Korea. E-mail: rovellia@world.kaist.ac.kr, E-mail: kschoi@world.kaist.ac.kr;The authors are with the Dept. of Computer Science, KAIST, 373--1 Guseong-Dong, Yuseong-Gu, Daejeon 305--701 Republic of Korea. E-mail: rovellia@world.kaist.ac.kr, E-mail: kschoi@world.kaist.ac.kr

  • Venue:
  • IEICE - Transactions on Information and Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Machine transliteration is an automatic method to generate characters or words in one alphabetical system for the corresponding characters in another alphabetical system. Machine transliteration can play an important role in natural language application such as information retrieval and machine translation, especially for handling proper nouns and technical terms. The previous works focus on either a grapheme-based or phoneme-based method. However, transliteration is an orthographical and phonetic converting process. Therefore, both grapheme and phoneme information should be considered in machine transliteration. In this paper, we propose a grapheme and phoneme-based transliteration model and compare it with previous grapheme-based and phoneme-based models using several machine learning techniques. Our method shows about 13∼78% performance improvement.