NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation

  • Authors:
  • Tong Xiao;Jingbo Zhu;Hao Zhang;Qiang Li

  • Affiliations:
  • Northeastern University and Key Laboratory of Medical Image Computing, Ministry of Education;Northeastern University and Key Laboratory of Medical Image Computing, Ministry of Education;Northeastern University;Northeastern University

  • Venue:
  • ACL '12 Proceedings of the ACL 2012 System Demonstrations
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a new open source toolkit for phrase-based and syntax-based machine translation. The toolkit supports several state-of-the-art models developed in statistical machine translation, including the phrase-based model, the hierachical phrase-based model, and various syntax-based models. The key innovation provided by the toolkit is that the decoder can work with various grammars and offers different choices of decoding algrithms, such as phrase-based decoding, decoding as parsing/tree-parsing and forest-based decoding. Moreover, several useful utilities were distributed with the toolkit, including a discriminative reordering model, a simple and fast language model, and an implementation of minimum error rate training for weight tuning.