Combining lexical, syntactic and prosodic cues for improved online dialog act tagging

  • Authors:
  • Vivek Kumar Rangarajan Sridhar;Srinivas Bangalore;Shrikanth Narayanan

  • Affiliations:
  • Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClilntock Avenue, Room EEB430, Los Angeles, CA 90089-2564, United States;AT&T Labs - Research 180 Park Avenue, Florham Park, NJ 07932, United States;Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClilntock Avenue, Room EEB430, Los Angeles, CA 90089-2564, United States

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Prosody is an important cue for identifying dialog acts. In this paper, we show that modeling the sequence of acoustic-prosodic values as n-gram features with a maximum entropy model for dialog act (DA) tagging can perform better than conventional approaches that use coarse representation of the prosodic contour through summative statistics of the prosodic contour. The proposed scheme for exploiting prosody results in an absolute improvement of 8.7% over the use of most other widely used representations of acoustic correlates of prosody. The proposed scheme is discriminative and exploits context in the form of lexical, syntactic and prosodic cues from preceding discourse segments. Such a decoding scheme facilitates online DA tagging and offers robustness in the decoding process, unlike greedy decoding schemes that can potentially propagate errors. Our approach is different from traditional DA systems that use the entire conversation for offline dialog act decoding with the aid of a discourse model. In contrast, we use only static features and approximate the previous dialog act tags in terms of lexical, syntactic and prosodic information extracted from previous utterances. Experiments on the Switchboard-DAMSL corpus, using only lexical, syntactic and prosodic cues from three previous utterances, yield a DA tagging accuracy of 72% compared to the best case scenario with accurate knowledge of previous DA tags (oracle), which results in 74% accuracy.