New approaches to audio-visual segmentation of TV news for automatic topic retrieval

  • Authors:
  • U. Iurgel;R. Meermeier;S. Eickeler;G. Rigoll

  • Affiliations:
  • Dept. of Comput. Sci., Gerhard-Mercator-University Duisburg, Germany;-;-;-

  • Venue:
  • ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents two new real-time approaches to segmentation of TV news shows into topics. The goal of this research work is the high precision retrieval of topics from TV news. For that purpose, the detection of correct topic boundaries is of great importance. We introduce a stochastic and a rule-based topic model based on HMM. The former combines features from the visual as well as from the audio channel of the news show, whereas the latter uses the video channel only. They are compared to the detection of topics using only the audio channel, which is common for many other approaches. The paper contains the following innovations: (1) the detected segment boundaries correspond directly to topics and not to video or audio cuts, as in most other segmentation methods; (2) an advanced stochastic topic model is introduced that uses audio as well as video features; (3) the introduced HMM-based approaches both outperform the audio-based approach. One algorithm has a very good topic boundary detection rate, whereas the other minimizes the number of wrongly inserted boundaries without missing too many real boundaries.