Three models for discriminative machine translation using global lexical selection and sentence reconstruction

  • Authors:
  • Sriram Venkatapathy;Srinivas Bangalore

  • Affiliations:
  • Language Technologies Research Centre, IIIT-Hyderabad Hyderabad, India;AT&T Labs - Research, Florham Park, NJ

  • Venue:
  • SSST '07 Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Machine translation of a source language sentence involves selecting appropriate target language words and ordering the selected words to form a well-formed target language sentence. Most of the previous work on statistical machine translation relies on (local) associations of target words/phrases with source words/phrases for lexical selection. In contrast, in this paper, we present a novel approach to lexical selection where the target words are associated with the entire source sentence (global) without the need for local associations. This technique is used by three models (Bag-of-words model, sequential model and hierarchical model) which predict the target language words given a source sentence and then order the words appropriately. We show that a hierarchical model performs best when compared to the other two models.