Syntax-based statistical machine translation using tree automata and tree transducers

  • Authors:
  • Daniel Emilio Beck

  • Affiliations:
  • Federal University of São Carlos

  • Venue:
  • HLT-SS '11 Proceedings of the ACL 2011 Student Session
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper I present a Master's thesis proposal in syntax-based Statistical Machine Translation. I propose to build discriminative SMT models using both tree-to-string and tree-to-tree approaches. Translation and language models will be represented mainly through the use of Tree Automata and Tree Transducers. These formalisms have important representational properties that makes them well-suited for syntax modeling. I also present an experiment plan to evaluate these models through the use of a parallel corpus written in English and Brazilian Portuguese.