Improving transliteration accuracy using word-origin detection and lexicon lookup

  • Authors:
  • Mitesh M. Khapra;Pushpak Bhattacharyya

  • Affiliations:
  • IIT Bombay;IIT Bombay

  • Venue:
  • NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

We propose a framework for transliteration which uses (i) a word-origin detection engine (pre-processing) (ii) a CRF based transliteration engine and (iii) a re-ranking model based on lexicon-lookup (post-processing). The results obtained for English-Hindi and English-Kannada transliteration show that the preprocessing and post-processing modules improve the top-1 accuracy by 7.1%.