Integrated Image and Speech Analysis for Content-Based Video Indexing

  • Authors:
  • Affiliations:
  • Venue:
  • ICMCS '96 Proceedings of the 1996 International Conference on Multimedia Computing and Systems
  • Year:
  • 1996

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we study an important problem in multimedia database, namely, the automatic extraction of indexing information from raw data based on video contents. The goal of our research project is to develop a prototype system for automatic indexing of sports videos. The novelty of our work is that we propose to integrate speech understanding and image analysis algorithms for extracting information. The main thrust of this work comes from the observation that in news or sports video indexing, usually speech analysis is more efficient in detecting events than image analysis. Therefore, in our system, the audio processing modules are first applied to locate candidates in the whole data. This information is passed to the video processing modules, which further analyze the video. The final products of video analysis are in the form of pointers to the locations of interesting events in a video. Our algorithms have been tested extensively with real TV programs, and results are presented and discussed in the paper.