Improving Arabic dependency parsing with form-based and functional morphological features

  • Authors:
  • Yuval Marton;Nizar Habash;Owen Rambow

  • Affiliations:
  • T.J. Watson Research Center, IBM;Columbia University;Columbia University

  • Venue:
  • HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We explore the contribution of morphological features -- both lexical and inflectional -- to dependency parsing of Arabic, a morphologically rich language. Using controlled experiments, we find that definiteness, person, number, gender, and the undiacritzed lemma are most helpful for parsing on automatically tagged input. We further contrast the contribution of form-based and functional features, and show that functional gender and number (e.g., "broken plurals") and the related rationality feature improve over form-based features. It is the first time functional morphological features are used for Arabic NLP.