Preprocessing for Unification Parsing of Spoken Language

  • Authors:
  • Mark-Jan Nederhof

  • Affiliations:
  • -

  • Venue:
  • NLP '00 Proceedings of the Second International Conference on Natural Language Processing
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Wordgraphs are structures that may be output by speech recognisers. We discuss various methods for turning wordgraphs into smaller structures. One of these methods is novel; this method relies on a new kind of determinization of acyclic weighted finite automata that is language-preserving but not fully weight-preserving, and results in smaller automata than in the case of traditional determinization of weighted finite automata. We present empirical data comparing the respective methods. The methods are relevant for systems in which wordgraphs form the input to kinds of syntactic analysis that are very time consuming, such as unification parsing.