Hierarchical phrase-based translation representations

  • Authors:
  • Gonzalo Iglesias;Cyril Allauzen;William Byrne;Adrià de Gispert;Michael Riley

  • Affiliations:
  • University of Cambridge, Cambridge, U. K.;Google Research, New York, NY;University of Cambridge, Cambridge, U. K.;University of Cambridge, Cambridge, U. K.;Google Research, New York, NY

  • Venue:
  • EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper compares several translation representations for a synchronous context-free grammar parse including CFGs/hypergraphs, finite-state automata (FSA), and pushdown automata (PDA). The representation choice is shown to determine the form and complexity of target LM intersection and shortest-path algorithms that follow. Intersection, shortest path, FSA expansion and RTN replacement algorithms are presented for PDAs. Chinese-to-English translation experiments using HiFST and HiPDT, FSA and PDA-based decoders, are presented using admissible (or exact) search, possible for HiFST with compact SCFG rulesets and HiPDT with compact LMs. For large rulesets with large LMs, we introduce a two-pass search strategy which we then analyze in terms of search errors and translation performance.