Formemes in English-Czech deep syntactic MT

  • Authors:
  • Ondřej Dušek;Zdeněk Žabokrtský;Martin Popel;Martin Majliš;Michal Novák;David Mareček

  • Affiliations:
  • Charles University in Prague, Prague;Charles University in Prague, Prague;Charles University in Prague, Prague;Charles University in Prague, Prague;Charles University in Prague, Prague;Charles University in Prague, Prague

  • Venue:
  • WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the most notable recent improvements of the TectoMT English-to-Czech translation is a systematic and theoretically supported revision of formemes---the annotation of morpho-syntactic features of content words in deep dependency syntactic structures based on the Prague tectogrammatics theory. Our modifications aim at reducing data sparsity, increasing consistency across languages and widening the usage area of this markup. Formemes can be used not only in MT, but in various other NLP tasks.