Semantic indexing of soccer audio-visual sequences: a multimodal approach based on controlled Markov chains

Authors:
R. Leonardi;P. Migliorati;M. Prandini
Affiliations:
Dept. of Electron. for Autom., Univ. of Brescia, Italy;-;-
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
2004

Citing 0
Cited 24

A probabilistic template-based approach to discovering repetitive patterns in broadcast videos

Proceedings of the 13th annual ACM international conference on Multimedia
Exciting event detection in broadcast soccer video with mid-level description and incremental learning

Proceedings of the 13th annual ACM international conference on Multimedia
Attention-based video summarisation in rushes collection

Proceedings of the international workshop on TRECVID video summarization
Audio keywords generation for sports video analysis

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Explicit semantic events detection and development of realistic applications for broadcasting baseball videos

Multimedia Tools and Applications
Attention guided football video content recommendation on mobile devices

MobiMedia '06 Proceedings of the 2nd international conference on Mobile multimedia communications
An instant semantics acquisition system of live soccer video with application to live event alert and on-the-fly language selection

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Automatic soccer players tracking in goal scenes by camera motion elimination

Image and Vision Computing
A framework for flexible summarization of racquet sports video using multiple modalities

Computer Vision and Image Understanding
General Highlight Detection in Sport Videos

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Interactive broadcast services for live soccer video based on instant semantics acquisition

Journal of Visual Communication and Image Representation
Players Clustering Based on Graph Theory for Tactics Analysis Purpose in Soccer Videos

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
A visual system for real time detection of goal events during soccer matches

Computer Vision and Image Understanding
An intelligent strategy for the automatic detection of highlights in tennis video recordings

Expert Systems with Applications: An International Journal
Personalized retrieval of sports video based on multi-modal analysis and user preference acquisition

Multimedia Tools and Applications
Semantic concept mining in cricket videos for automated highlight generation

Multimedia Tools and Applications
A review of vision-based systems for soccer video analysis

Pattern Recognition
Modeling spatiotemporal relationships between moving objects for event tactics analysis in tennis videos

Multimedia Tools and Applications
Knowledge-discounted event detection in sports video

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans - Special issue on model-based diagnostics
Bayesian belief network based broadcast sports video indexing

Multimedia Tools and Applications
A template-based baseball video scene classification using efficient playfield segmentation

Multimedia Tools and Applications
Football video segmentation based on video production strategy

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
A video summarization method for basketball game

PCM'05 Proceedings of the 6th Pacific-Rim conference on Advances in Multimedia Information Processing - Volume Part I
Multimedia Databases and Data Management: A Survey

International Journal of Multimedia Data Engineering & Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Content characterization of sport videos is a subject of great interest to researchers working on the analysis of multimedia documents. In this paper, we propose a semantic indexing algorithm which uses both audio and visual information for salient event detection in soccer. The video signal is processed first by extracting low-level visual descriptors directly from an MPEG-2 bit stream. It is assumed that any instance of an event of interest typically affects two consecutive shots and is characterized by a different temporal evolution of the visual descriptors in the two shots. This motivates the introduction of a controlled Markov chain to describe such evolution during an event of interest, with the control input modeling the occurrence of a shot transition. After adequately training different controlled Markov chain models, a list of video segments can be extracted to represent a specific event of interest using the maximum likelihood criterion. To reduce the presence of false alarms, low-level audio descriptors are processed to order the candidate video segments in the list so that those associated to the event of interest are likely to be found in the very first positions. We focus in particular on goal detection, which represents a key event in a soccer game, using camera motion information as a visual cue and the "loudness" as an audio descriptor. The experimental results show the effectiveness of the proposed multimodal approach.