Major cast detection in video using both audio and visual information

  • Authors:
  • Zhu Liu;Yao Wang

  • Affiliations:
  • AT&TLabs - Res., Middletown, NJ, USA;-

  • Venue:
  • ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Major casts, for example, the anchor persons or reporters in news broadcast programs and principle characters in movies play an important role in video, and their occurrences provide good indices for organizing and presenting video content. This paper describes a new approach for automatically generating the list of major casts in a video sequence based on multiple modalities, specifically, both speaker and face information. A list of major casts is created and ordered by the accumulative temporal and spatial presence of corresponding casts. Preliminary simulation results show that the detected major casts are meaningful and the proposed approach is promising.