Parsing Ill-Formed Text Using an Error Grammar

  • Authors:
  • Jennifer Foster;Carl Vogel

  • Affiliations:
  • Computational Linguistics Group, Trinity College, University of Dublin, Dublin 2, Ireland, E-mail: jfoster@tcd.ie;Computational Linguistics Group, Trinity College, University of Dublin, Dublin 2, Ireland

  • Venue:
  • Artificial Intelligence Review
  • Year:
  • 2004

Quantified Score

Hi-index 0.02

Visualization

Abstract

This paper presents a robust parsing approach which is designed to address the issue of syntactic errors in text. The approach is based on the concept of an error grammar which is a grammar of ungrammatical sentences. An error grammar is derived from a conventional grammar on the basis of an analysis of a corpus of observed ill-formed sentences. A robust parsing algorithm is presented which is applied after a conventional bottom–up parsing algorithm has failed. This algorithm combines a rule from the error grammar with rules from the normal grammar to arrive at a parse for an ungrammatical sentence. This algorithm is applied to 50 test sentences, with encouraging results.