Robust data oriented spoken language understanding

  • Authors:
  • Khalil Sima'an

  • Affiliations:
  • Computational Linguistics, University of Amsterdam, Amsterdam, The Netherlands

  • Venue:
  • New developments in parsing technology
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spoken utterances do not always abide by linguistically motivated grammatical rules. These utterances exhibit various phenomena considered outside the realm of theoretically-oriented linguistic research. For a language model that extends linguistically motivated grammars with probabilistic reasoning, the problem is how to feature the robustness that is necessary for speech understanding. This paper addresses the issue of the robustness of the Data Oriented Parsing (DOP) model within a Dutch speech-based dialogue system. It presents an extension of the DOP model into a head-driven variant, which allows for Markovian generation of parse trees. It is shown empirically that the new variant improves over the original DOP model on two tasks: the formal understanding of speech utterances, and the extraction of semantic concepts from word lattices output by a speech recognizer.