A topic similarity model for hierarchical phrase-based translation

  • Authors:
  • Xinyan Xiao;Deyi Xiong;Min Zhang;Qun Liu;Shouxun Lin

  • Affiliations:
  • Key Lab. of Intelligent Info. Processing, Institute of Computing Technology, Chinese Academy of Sciences;Human Language Technology, Institute for Infocomm Research;Human Language Technology, Institute for Infocomm Research;Key Lab. of Intelligent Info. Processing, Institute of Computing Technology, Chinese Academy of Sciences;Key Lab. of Intelligent Info. Processing, Institute of Computing Technology, Chinese Academy of Sciences

  • Venue:
  • ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Previous work using topic model for statistical machine translation (SMT) explore topic information at the word level. However, SMT has been advanced from word-based paradigm to phrase/rule-based paradigm. We therefore propose a topic similarity model to exploit topic information at the synchronous rule level for hierarchical phrase-based translation. We associate each synchronous rule with a topic distribution, and select desirable rules according to the similarity of their topic distributions with given documents. We show that our model significantly improves the translation performance over the baseline on NIST Chinese-to-English translation experiments. Our model also achieves a better performance and a faster speed than previous approaches that work at the word level.