Content-Based TV Sports Video Retrieval Based on Audio-Visual Features and Text Information

  • Authors:
  • Liu Huayong

  • Affiliations:
  • Central China Normal University, China

  • Venue:
  • WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose content-based video retrieval, which is a kind of retrieval by its semantical contents. Because video data is composed of multimodal information streams such as visual, auditory and textual streams, we describe a strategy of using multimodal analysis for automatic parsing sports video. The paper first defines the basic structure of sports video database system, and then introduces a new approach that integrates visual streams analysis, speech recognition, speech signal processing and text extraction to realize video retrieval. The experimental results for TV sports video of football games indicate that multimodal analysis is effective for video retrieval by quickly browsing tree-like video clips or inputting keywords within predefined domain.