Highlight sound effects detection in audio stream

  • Authors:
  • Rui Cai;Lie Lu;Hong-Jiang Zhang;Lian-Hong Cai

  • Affiliations:
  • Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China;Dipt. Sistemi e Informatica, Firenze Univ., Italy;Dipt. Sistemi e Informatica, Firenze Univ., Italy;IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

  • Venue:
  • ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 3 (ICME '03) - Volume 03
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the problem of highlight sound effects detection in audio stream, which is very useful in fields of video summarization and highlight extraction. Unlike researches on audio segmentation and classification, in this domain, it just locates those highlight sound effects in audio stream. An extensible framework is proposed and in current system three sound effects are considered: laughter, applause and cheer, which are tied up with highlight events in entertainments, sports, meetings and home videos. HMMs are used to model these sound effects and a log-likelihood scores based method is used to make final decision. A sound effect attention model is also proposed to extend general audio attention model for highlight extraction and video summarization. Evaluations on a 2-hours audio database showed very encouraging results.