Multi-level information and automatic dialog act detection in human-human spoken dialogs

  • Authors:
  • S. Rosset;D. Tribout;L. Lamel

  • Affiliations:
  • Spoken Language Processing Group, LIMSI-CNRS, F-91403 Orsay Cedex, BP 133, France;Spoken Language Processing Group, LIMSI-CNRS, F-91403 Orsay Cedex, BP 133, France;Spoken Language Processing Group, LIMSI-CNRS, F-91403 Orsay Cedex, BP 133, France

  • Venue:
  • Speech Communication
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper reports studies on annotating and automatically detecting dialog acts in human-human spoken dialogs. The work reposes on three hypotheses: first, the succession of dialog acts is strongly constrained; second, the initial word and semantic class of word are more important for identifying dialog acts than the complete exact word sequence of an utterance; third, most of the important information is encoded in specific entities. A memory based learning approach is used to detect dialog acts. For each utterance unit, eight dialog acts are systematically annotated. Experiments have been conducted using different levels of information, with and without the use of dialog history information. In order to assess the generality of the method, the specific entity tag based model trained on a French corpus was tested on an English corpus for a similar task and on a French corpus from a different domain. A correct dialog act detection rate of about 86% is obtained for the same domain/language condition and 77% for the cross-language or cross-domain conditions.