Content-Based TV Sports Video Retrieval Based on Audio-Visual Features and Text Information

Authors:
Liu Huayong
Affiliations:
Central China Normal University, China
Venue:
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Year:
2004

Citing 3
Cited 0

Content-Based Video Indexing and Retrieval

IEEE MultiMedia
Visual digests for news video libraries

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Automatic Parsing of TV Soccer Programs

ICMCS '95 Proceedings of the International Conference on Multimedia Computing and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose content-based video retrieval, which is a kind of retrieval by its semantical contents. Because video data is composed of multimodal information streams such as visual, auditory and textual streams, we describe a strategy of using multimodal analysis for automatic parsing sports video. The paper first defines the basic structure of sports video database system, and then introduces a new approach that integrates visual streams analysis, speech recognition, speech signal processing and text extraction to realize video retrieval. The experimental results for TV sports video of football games indicate that multimodal analysis is effective for video retrieval by quickly browsing tree-like video clips or inputting keywords within predefined domain.