Learning finite state transducers using bilingual phrases

  • Authors:
  • Jorge González;Germán Sanchis;Francisco Casacuberta

  • Affiliations:
  • Instituto Tecnológico de Informática, Universidad Politécnica de Valencia;Instituto Tecnológico de Informática, Universidad Politécnica de Valencia;Instituto Tecnológico de Informática, Universidad Politécnica de Valencia

  • Venue:
  • CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

Statistical Machine Translation is receiving more and more attention every day due to the success that the phrase-based alignment models are obtaining. However, despite their power, state-of-the-art systems using these models present a series of disadvantages that lessen their effectiveness in working environments where temporal or spacial computational resources are limited. A finite-state framework represents an interesting alternative because it constitutes an efficient paradigm where quality and realtime factors are properly integrated in order to build translation devices that may be of help for their potential users. Here, we describe a way to use the bilingual information in a phrase-based model in order to implement a phrase-based ngram model using finite state transducers. It will be worth the trouble due to the notable decrease in computational requirements that finite state transducers present in practice with respect to the use of some well-known stack-decoding algorithms. Results for the French-English EuroParl benchmark corpus from the 2006 Workshop on Machine Translation of the ACL are reported.