Hungarian-English machine translation using genpar

  • Authors:
  • András Hócza;András Kocsor

  • Affiliations:
  • Department of Informatics, University of Szeged, Szeged, Hungary;Department of Informatics, University of Szeged, Szeged, Hungary

  • Venue:
  • TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an approach for machine translation by applying the GenPar toolkit on POS-tagged and syntactically parsed texts Our experiment in Hungarian-English machine translation is an attempt to develop prototypes of a syntax-driven machine translation system and to examine the effects of various preprocessing steps (POS-tagging, lemmatization and syntactic parsing) on system performance The annotated monolingual texts needed for different language specific tasks were taken from the Szeged Treebank and the Penn Treebank The parallel sentences were collected from the Hunglish Corpus Each developed prototype runs fully automatically and new Hungarian-related functions are built in The results are evaluated with BLEU score.