Jane: an advanced freely available hierarchical machine translation toolkit

  • Authors:
  • David Vilar;Daniel Stein;Matthias Huck;Hermann Ney

  • Affiliations:
  • RWTH Aachen University, Aachen, Germany 52056 and DFKI GmbH, Berlin, Germany 10559;RWTH Aachen University, Aachen, Germany 52056;RWTH Aachen University, Aachen, Germany 52056;RWTH Aachen University, Aachen, Germany 52056

  • Venue:
  • Machine Translation
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article we will describe the design and implementation of Jane, an efficient hierarchical phrase-based (HPB) toolkit developed at RWTH Aachen University. The system has been used by RWTH at several international evaluation campaigns, including the WMT and NIST evaluations, and is now freely available for non-commercial application. We will go through the main features of Jane, which include, among others, support for different search strategies, different language model formats, support for syntax-based enhancements to the HPB machine translation paradigm, string-to-dependency translation, extended lexicon models, different methods for minimum-error-rate training and distributed operation on a computer cluster. Special attention has been paid to the efficiency of the decoder, clean code and quality assurance through unit and regression testing. Results on current machine translation tasks are reported, which show that the system is able to obtain state-of-the-art performance.