Thematic segmentation of meetings through document/speech alignment

  • Authors:
  • Dalila Mekhaldi;Denis Lalanne;Rolf Ingold

  • Affiliations:
  • DIVA/DIUF, Fribourg, Switzerland;DIVA/DIUF, Fribourg, Switzerland;DIVA/DIUF, Fribourg, Switzerland

  • Venue:
  • Proceedings of the 12th annual ACM international conference on Multimedia
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article proposes a multimodal approach for segmenting meeting recordings. This bi-modal method takes advantages of the alignment of speech transcript with documents, in the context of meetings or lectures, where documents are discussed. The method first displays the alignment results as a set of nodes in a 2D space, where the two axes represent respectively the documents content and the speech transcript. The most connected regions in this graph are detected using a clustering method. The final clusters are then projected on the speech axis. Finally, the obtained sequence of segments is considered as the thematic structure of the speech transcript. In this article, we present our bi-modal method and compare it with two other mono-modal thematic segmentation methods.