Machine transliteration: leveraging on third languages

  • Authors:
  • Min Zhang;Xiangyu Duan;Vladimir Pervouchine;Haizhou Li

  • Affiliations:
  • Institute for Infocomm Research, A-STAR;Institute for Infocomm Research, A-STAR;Institute for Infocomm Research, A-STAR;Institute for Infocomm Research, A-STAR

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents two pivot strategies for statistical machine transliteration, namely system-based pivot strategy and model-based pivot strategy. Given two independent source-pivot and pivot-target name pair corpora, the model-based strategy learns a direct source-target transliteration model while the system-based strategy learns a source-pivot model and a pivot-target model, respectively. Experimental results on benchmark data show that the system-based pivot strategy is effective in reducing the high resource requirement of training corpus for low-density language pairs while the model-based pivot strategy performs worse than the system-based one.