Analysis and system combination of phrase- and N-gram-based statistical machine translation systems

  • Authors:
  • Marta R. Costa-jussà;Josep M. Crego;David Vilar;José A. R. Fonollosa;José B. Mariño;Hermann Ney

  • Affiliations:
  • TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;RWTH Aachen University, Aachen, Germany;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;RWTH Aachen University, Aachen, Germany

  • Venue:
  • NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the framework of the Tc-Star project, we analyze and propose a combination of two Statistical Machine Translation systems: a phrase-based and an N-gram-based one. The exhaustive analysis includes a comparison of the translation models in terms of efficiency (number of translation units used in the search and computational time) and an examination of the errors in each system's output. Additionally, we combine both systems, showing accuracy improvements.