English-to-Korean transliteration using multiple unbounded overlapping phoneme chunks

  • Authors:
  • In-Ho Kang;GilChang Kim

  • Affiliations:
  • Korea Advanced Institute of Science and Technology;Korea Advanced Institute of Science and Technology

  • Venue:
  • COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present in this paper the method of English-to-Korean (E-K) transliteration and back-transliteration. In Korean technical documents, many English words are transliterated into Korean words in various forms in diverse ways. As English words and Korean transliterations are usually technical terms and proper nouns, it is hard to find a transliteration and its variations in a dictionary. Therefore an automatic transliteration system is needed to find the transliterations of English words without manual intervention.To explain E-K transliteration phenomena, we use phoneme chunks that do not have a length limit. By applying phoneme chunks, we combine different length information with easy. The E-K transliteration method has three steps. In the first, we make a phoneme network that shows all possible transliterations of the given word. In the second step, we apply phoneme chunks, extracted from training data, to calculate the reliability of each possible transliteration. Then we obtain probable transliterations of the given English word.