Some improvements in phrase-based statistical machine translation

  • Authors:
  • Zhendong Yang;Wei Pang;Jinhua Du;Wei Wei;Bo Xu

  • Affiliations:
  • Hi-tech Innovation Center, Institute of Automation, Chinese Academy of Sciences, Beijing;Hi-tech Innovation Center, Institute of Automation, Chinese Academy of Sciences, Beijing;Hi-tech Innovation Center, Institute of Automation, Chinese Academy of Sciences, Beijing;Hi-tech Innovation Center, Institute of Automation, Chinese Academy of Sciences, Beijing;Hi-tech Innovation Center, Institute of Automation, Chinese Academy of Sciences, Beijing

  • Venue:
  • ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In statistical machine translation, many of the top-performing systems are phrase-based systems. This paper describes a phrase-based translation system and some improvements. We use more information to compute translation probability. The scaling factors of the log-linear models are estimated by the minimum error rate training that uses an evaluation criteria to balance BLEU and NIST scores. We extract phrase-template from initial phrases to deal with data sparseness and distortion problem through decoding. By re-ranking the n-best list of translations generated firstly, the system gets the final output. Some experiments concerned show that all these refinements are beneficial to get better results.