Parsing formal languages using natural language parsing techniques

  • Authors:
  • Jens Nilsson;Welf Löwe;Johan Hall;Joakim Nivre

  • Affiliations:
  • Växjö University, Sweden;Växjö University, Sweden;Växjö University, Sweden;Uppsala University, Sweden and Växjö University, Sweden

  • Venue:
  • IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Program analysis tools used in software maintenance must be robust and ought to be accurate. Many data-driven parsing approaches developed for natural languages are robust and have quite high accuracy when applied to parsing of software. We show this for the programming languages Java, C/C++, and Python. Further studies indicate that post-processing can almost completely remove the remaining errors. Finally, the training data for instantiating the generic data-driven parser can be generated automatically for formal languages, as opposed to the manually development of treebanks for natural languages. Hence, our approach could improve the robustness of software maintenance tools, probably without showing a significant negative effect on their accuracy.