Modeling lexical cohesion for document-level machine translation

  • Authors:
  • Deyi Xiong;Guosheng Ben;Min Zhang;Yajuan Lü;Qun Liu

  • Affiliations:
  • School of Computer Science and Technology, Soochow University, Suzhou, China and Institute for Infocomm Research, Connexis, Singapore;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, China;School of Computer Science and Technology, Soochow University, Suzhou, China and Institute for Infocomm Research, Connexis, Singapore;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, China and Centre for Next Generation Localisation, School of Computing, Dublin City University, Ireland

  • Venue:
  • IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Lexical cohesion arises from a chain of lexical items that establish links between sentences in a text. In this paper we propose three different models to capture lexical cohesion for document-level machine translation: (a) a direct reward model where translation hypotheses are rewarded whenever lexical cohesion devices occur in them, (b) a conditional probability model where the appropriateness of using lexical cohesion devices is measured, and (c) a mutual information trigger model where a lexical cohesion relation is considered as a trigger pair and the strength of the association between the trigger and the triggered item is estimated by mutual information. We integrate the three models into hierarchical phrase-based machine translation and evaluate their effectiveness on the NIST Chinese-English translation tasks with large-scale training data. Experiment results show that all three models can achieve substantial improvements over the baseline and that the mutual information trigger model performs better than the others.