Opening Statistical Translation Engines to Terminological Resources

  • Authors:
  • Philippe Langlais

  • Affiliations:
  • -

  • Venue:
  • NLDB '02 Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The past decade has witnessed exciting work in the field of Statistical Machine Translation (SMT). However, accurate evaluation of its potential in real-life contexts is still a questionable issue.In this study, we investigate the behavior of a SMT engine faced with a corpus far different from the one it has been trained on. We show that terminological databases are obvious resources that should be used to boost the performance of a statistical engine. We propose and evaluate a way of integrating terminology into a SMT engine which yields a significant reduction in word error rate.