A hybrid model for extracting transliteration equivalents from parallel corpora

  • Authors:
  • Jong-Hoon Oh;Key-Sun Choi;Hitoshi Isahara

  • Affiliations:
  • Computational Linguistics Group, NICT, Kyoto, Japan;Computer Science Division, EECS, KAIST, Daejeon, Republic of Korea;Computational Linguistics Group, NICT, Kyoto, Japan

  • Venue:
  • TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several models for transliteration pair acquisition have been proposed to overcome the out-of-vocabulary problem caused by transliterations To date, however, there has been little literature regarding a framework that can accommodate several models at the same time Moreover, there is little concern for validating acquired transliteration pairs using up-to-date corpora, such as web documents To address these problems, we propose a hybrid model for transliteration pair acquisition In this paper, we concentrate on a framework for combining several models for transliteration pair acquisition Experiments showed that our hybrid model was more effective than each individual transliteration pair acquisition model alone.