Two methods to incorporate local morphosyntactic features in Hindi dependency parsing

  • Authors:
  • Bharat Ram Ambati;Samar Husain;Sambhav Jain;Dipti Misra Sharma;Rajeev Sangal

  • Affiliations:
  • IIIT-Hyderabad, India;IIIT-Hyderabad, India;IIIT-Hyderabad, India;IIIT-Hyderabad, India;IIIT-Hyderabad, India

  • Venue:
  • SPMRL '10 Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we explore two strategies to incorporate local morphosyntactic features in Hindi dependency parsing. These features are obtained using a shallow parser. We first explore which information provided by the shallow parser is most beneficial and show that local morphosyntactic features in the form of chunk type, head/non-head information, chunk boundary information, distance to the end of the chunk and suffix concatenation are very crucial in Hindi dependency parsing. We then investigate the best way to incorporate this information during dependency parsing. Further, we compare the results of various experiments based on various criterions and do some error analysis. All the experiments were done with two data-driven parsers, MaltParser and MSTParser, on a part of multi-layered and multi-representational Hindi Treebank which is under development. This paper is also the first attempt at complete sentence level parsing for Hindi.