IDL-expressions: a formalism for representing and parsing finite languages in natural language processing

  • Authors:
  • Mark-Jan Nederhof;Giorgio Satta

  • Affiliations:
  • Faculty of Arts, University of Groningen, Groningen, The Netherlands;Dept. of Information Engineering, University of Padua, Padova, Italy

  • Venue:
  • Journal of Artificial Intelligence Research
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a formalism for representation of finite languages, referred to as the class of IDL-expressions, which combines concepts that were only considered in isolation in existing formalisms. The suggested applications are in natural language processing, more specifically in surface natural language generation and in machine translation, where a sentence is obtained by first generating a large set of candidate sentences, represented in a compact way, and then filtering such a set through a parser. We study several formal properties of IDL-expressions and compare this new formalism with more standard ones. We also present a novel parsing algorithm for IDL-expressions and prove a non-trivial upper bound on its time complexity.