An automatic approach to treebank error detection using a dependency parser

  • Authors:
  • Bhasha Agrawal;Rahul Agarwal;Samar Husain;Dipti M. Sharma

  • Affiliations:
  • IIIT-Hyderabad, India;IIIT-Hyderabad, India;University of Potsdam, Germany;IIIT-Hyderabad, India

  • Venue:
  • CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Treebanks play an important role in the development of various natural language processing tools. Amongst other things, they provide crucial language-specific patterns that are exploited by various machine learning techniques. Quality control in any treebanking project is therefore extremely important. Manual validation of the treebank is one of the steps that is generally necessary to ensure good annotation quality. Needless to say, manual validation requires a lot of human time and effort. In this paper, we present an automatic approach which helps in detecting potential errors in a treebank. We use a dependency parser to detect such errors. By using this tool, validators can validate a treebank in less time and with reduced human effort.