The TALP-UPC Ngram-based statistical machine translation system for ACL-WMT 2008

  • Authors:
  • Maxim Khalilov;Adolfo Hernández H.;Marta R. Costa-jussà;Josep M. Crego;Carlos A. Henríquez Q.;Patrik Lambert;José A. R. Fonollosa;José B. Mariño;Rafael E. Banchs

  • Affiliations:
  • TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain;TALP Research Center (UPC), Barcelona, Spain

  • Venue:
  • StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper reports on the participation of the TALP Research Center of the UPC (Universitat Politècnica de Catalunya) to the ACL WMT 2008 evaluation campaign. This year's system is the evolution of the one we employed for the 2007 campaign. Main updates and extensions involve linguistically motivated word reordering based on the reordering patterns technique. In addition, this system introduces a target language model, based on linguistic classes (Part-of-Speech), morphology reduction for an inflectional language (Spanish) and an improved optimization procedure. Results obtained over the development and test sets on Spanish to English (and the other way round) translations for both the traditional Europarl and a challenging News stories tasks are analyzed and commented.