A hybrid model for extracting transliteration equivalents from parallel corpora

Authors:
Jong-Hoon Oh;Key-Sun Choi;Hitoshi Isahara
Affiliations:
Computational Linguistics Group, NICT, Kyoto, Japan;Computer Science Division, EECS, KAIST, Daejeon, Republic of Korea;Computational Linguistics Group, NICT, Kyoto, Japan
Venue:
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Year:
2006

Citing 0
Cited 2

Machine transliteration survey

ACM Computing Surveys (CSUR)
Extracting english-korean transliteration pairs from web corpora

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead

Quantified Score

Hi-index	0.00

Visualization

Abstract

Several models for transliteration pair acquisition have been proposed to overcome the out-of-vocabulary problem caused by transliterations To date, however, there has been little literature regarding a framework that can accommodate several models at the same time Moreover, there is little concern for validating acquired transliteration pairs using up-to-date corpora, such as web documents To address these problems, we propose a hybrid model for transliteration pair acquisition In this paper, we concentrate on a framework for combining several models for transliteration pair acquisition Experiments showed that our hybrid model was more effective than each individual transliteration pair acquisition model alone.