Explicit length modelling for statistical machine translation

  • Authors:
  • Joan Albert Silvestre-Cerdí;JesúS AndréS-Ferrer;Jorge Civera

  • Affiliations:
  • Universitat Politècnica de València, Departament de Sistemes Informítics i Computació, Camí de Vera s/n, 46022 València, Spain;Universitat Politècnica de València, Departament de Sistemes Informítics i Computació, Camí de Vera s/n, 46022 València, Spain;Universitat Politècnica de València, Departament de Sistemes Informítics i Computació, Camí de Vera s/n, 46022 València, Spain

  • Venue:
  • Pattern Recognition
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

Explicit length modelling has been previously explored in statistical pattern recognition with successful results. In this paper, two length models along with two parameter estimation methods and two alternative parametrisations for statistical machine translation (SMT) are presented. More precisely, we incorporate explicit bilingual length modelling in a state-of-the-art log-linear SMT system as an additional feature function in order to prove the contribution of length information. Finally, a systematic evaluation on reference SMT tasks considering different language pairs proves the benefits of explicit length modelling.