Human judgements in parallel treebank alignment

  • Authors:
  • Martin Volk;Torsten Marek;Yvonne Samuelsson

  • Affiliations:
  • University of Zurich, Zurich, Switzerland;University of Zurich, Zurich, Switzerland;Stockholm University, Stockholm, Sweden

  • Venue:
  • HumanJudge '08 Proceedings of the Workshop on Human Judgements in Computational Linguistics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We have built a parallel treebank that includes word and phrase alignment. The alignment information was manually checked using a graphical tool that allows the annotator to view a pair of trees from parallel sentences. We found the compilation of clear alignment guidelines to be a difficult task. However, experiments with a group of students have shown that we are on the right track with up to 89% overlap between the student annotation and our own. At the same time these experiments have helped us to pin-point the weaknesses in the guidelines, many of which concerned unclear rules related to differences in grammatical forms between the languages.