An effective news anchorperson shot detection method based on adaptive audio/visual model generation

  • Authors:
  • Sang-Kyun Kim;Doo Sun Hwang;Ji-Yeun Kim;Yang-Seock Seo

  • Affiliations:
  • Computing Lab., Digital Research Center, Samsung A.I.T., Yongin-si, Republic of Korea;Computing Lab., Digital Research Center, Samsung A.I.T., Yongin-si, Republic of Korea;Computing Lab., Digital Research Center, Samsung A.I.T., Yongin-si, Republic of Korea;Computing Lab., Digital Research Center, Samsung A.I.T., Yongin-si, Republic of Korea

  • Venue:
  • CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

A multi-modal method to improve the performance of the anchorperson shot detection for news story segmentation is proposed in this paper. The anchorperson voice information is used for the verification of anchorperson shot candidates extracted by visual information. The algorithm starts with the anchorperson voice shot candidate extraction using time and silence condition. The anchorperson templates are generated from the anchorperson face and cloth information from the anchorperson voice shots extracted. The anchorperson voice models are then created after segregating anchorperson voice shots containing 2 or more voices. The anchorperson voice model verifies the anchorperson shot candidates obtained from visual information. 720 minutes of news programs are tested and experimental results are demonstrated.