Hebrew dependency parsing: initial results

  • Authors:
  • Yoav Goldberg;Michael Elhadad

  • Affiliations:
  • Ben Gurion University of the Negev, Be'er Sheva, Israel;Ben Gurion University of the Negev, Be'er Sheva, Israel

  • Venue:
  • IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a newly available Hebrew Dependency Treebank, which is extracted from the Hebrew (constituency) Tree-bank. We establish some baseline unlabeled dependency parsing performance on Hebrew, based on two state-of-the-art parsers, MST-parser and MaltParser. The evaluation is performed both in an artificial setting, in which the data is assumed to be properly morphologically segmented and POS-tagged, and in a real-world setting, in which the parsing is performed on automatically segmented and POS-tagged text. We present an evaluation measure that takes into account the possibility of incompatible token segmentation between the gold standard and the parsed data. Results indicate that (a) MST-parser performs better on Hebrew data than Malt-Parser, and (b) both parsers do not make good use of morphological information when parsing Hebrew.