Refining lexical translation training scheme for improving the quality of statistical phrase-based translation

  • Authors:
  • Cuong Hoang;Cuong Anh Le;Son Bao Pham

  • Affiliations:
  • University of Engineering and Technology, Vietnam National University, Hanoi;University of Engineering and Technology, Vietnam National University, Hanoi;University of Engineering and Technology, Vietnam National University, Hanoi

  • Venue:
  • Proceedings of the Third Symposium on Information and Communication Technology
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Under word-based alignment, frequent words with consistent translations can be aligned at a high rate of precision. However, the words that are less frequent or exhibit diverse translations in training corpora generally do not have statistically significant evidences for confident alignments [7]. In this work, we will focus on proposing a bootstrapping algorithm to capture those less frequent or exhibit diverse alignments. Interestingly, we avoid making any explicit assumption concerning with the pair of languages used. As the result, we take the experimental evaluations on two phrase-based translation systems: the English-Vietnamese and English-French translation systems. Experiments point out a significant "boosting" capacity for the quality in overall for both these tasks.