A Comparison of Language Models for Dialog Act Segmentation of Meeting Transcripts

  • Authors:
  • Jáchym Kolář

  • Affiliations:
  • Department of Cybernetics at Faculty of Applied Sciences, University of West Bohemia, Plzeň, Czech Republic CZ-306 14

  • Venue:
  • TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
  • Year:
  • 2008

Quantified Score

Hi-index 0.05

Visualization

Abstract

This paper compares language modeling techniques for dialog act segmentation of multiparty meetings. The evaluation is twofold; we search for a convenient representation of textual information and an efficient modeling approach. The textual features capture word identities, parts-of-speech, and automatically induced classes. The models under examination include hidden event language models, maximum entropy, and BoosTexter. All presented methods are tested using both human-generated reference transcripts and automatic transcripts obtained from a state-of-the-art speech recognizer.