A Controlled Skip Parser

  • Authors:
  • Machine Translation staff

  • Affiliations:
  • -

  • Venue:
  • Machine Translation
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

Real-world natural language sentences are often long and complex, and contain unexpected grammatical constructions. They even include noiseand ungrammaticality. This paper describes the Controlled SkipParser, a program that parses such real-world sentences by skippingsome of the words in the sentence. The new feature of this parser is that itcontrols its behavior by finding out which words to skip, without usingdomain-specific knowledge. The parser is a priority-based chartparser. By assigning appropriate priority levels to the constituentsin the chart, the parser‘s behavior is controlled. Statisticalinformation is used for assigning priority levels. The statisticalinformation (n-grams) can be thought of as a generalized approximationof the grammar learned from past successful experiences. The controlmechanism gives a great speed-up and reduction in memory usage. Experiments on real newspaper articles are shown, and our experiencewith this parser in a machine translation system is described.