A Generic Approach for Systematic Analysis of Sports Videos

Authors:
Ning Zhang;Ling-Yu Duan;Lingfang Li;Qingming Huang;Jun Du;Wen Gao;Ling Guan
Affiliations:
Ryerson University;Peking University;Institute of Computing Technology, Chinese Academy of Sciences;Institute of Computing Technology, Chinese Academy of Sciences;NEC Research Labs, China;Peking University;Ryerson University
Venue:
ACM Transactions on Intelligent Systems and Technology (TIST)
Year:
2012

Citing 45
Cited 0

Automatic recognition of film genres

Proceedings of the third ACM international conference on Multimedia
Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Data clustering: a review

ACM Computing Surveys (CSUR)
The Earth Mover's Distance as a Metric for Image Retrieval

International Journal of Computer Vision
Automatic detection of 'Goal' segments in basketball videos

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Event detection in baseball video using superimposed caption recognition

Proceedings of the tenth ACM international conference on Multimedia
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Automatic Genre Identification for Content-Based Video Categorization

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
A mid-level representation framework for semantic sports video analysis

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Shallow parsing with conditional random fields

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Exciting event detection in broadcast soccer video with mid-level description and incremental learning

Proceedings of the 13th annual ACM international conference on Multimedia
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Hidden Conditional Random Fields for Gesture Recognition

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Semantic Event Detection using Conditional Random Fields

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Sports video categorizing method using camera motion parameters

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
A fusion scheme of visual and auditory modalities for event detection in sports video

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Video classification using spatial-temporal features and PCA

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 3 (ICME '03) - Volume 03
Automatic Sports Video Genre Classification using Pseudo-2D-HMM

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 04
Investigation on unsupervised clustering algorithms for video shot categorization

Soft Computing - A Fusion of Foundations, Methodologies and Applications
Live sports event detection based on broadcast video and web-casting text

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
Evaluating bag-of-visual-words representations in scene classification

Proceedings of the international workshop on Workshop on multimedia information retrieval
Hidden Conditional Random Fields

IEEE Transactions on Pattern Analysis and Machine Intelligence
Structure and event mining in sports video with efficient mosaic

Multimedia Tools and Applications
New Real-Time Approaches for Video-Genre-Classification Using High-Level Descriptors and a Set of Classifiers

ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
Unsupervised Clustering Algorithm for Video Shots Using Spectral Division

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing
Parallel neural networks for multimodal video genre classification

Multimedia Tools and Applications
Automatic sports genre categorization and view-type classification over large-scale dataset

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Content-based and concept-based retrieval for large-scale image/video collections

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Event tactic analysis based on broadcast sports video

IEEE Transactions on Multimedia
Understanding video events: a survey of methods for automatic interpretation of semantic occurrences in video

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Recognition of Semantic Basketball Events Based on Optical Flow Patterns

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part II
Hierarchical decision making scheme for sports video categorisation with temporal post-processing

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
SURF: speeded up robust features

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Event based indexing of broadcasted sports video by intermodalcollaboration

IEEE Transactions on Multimedia
Using Webcast Text for Semantic Event Detection in Broadcast Sports Video

IEEE Transactions on Multimedia
Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study

IEEE Transactions on Multimedia
Automatic Video Classification: A Survey of the Literature

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Automatic soccer video analysis and summarization

IEEE Transactions on Image Processing
Rapid estimation of camera motion from compressed video with application to video annotation

IEEE Transactions on Circuits and Systems for Video Technology
Event detection in field sports video using audio-visual features and a support vector Machine

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Various innovative and original works have been applied and proposed in the field of sports video analysis. However, individual works have focused on sophisticated methodologies with particular sport types and there has been a lack of scalable and holistic frameworks in this field. This article proposes a solution and presents a systematic and generic approach which is experimented on a relatively large-scale sports consortia. The system aims at the event detection scenario of an input video with an orderly sequential process. Initially, domain knowledge-independent local descriptors are extracted homogeneously from the input video sequence. Then the video representation is created by adopting a bag-of-visual-words (BoW) model. The video’s genre is first identified by applying the k-nearest neighbor (k-NN) classifiers on the initially obtained video representation, and various dissimilarity measures are assessed and evaluated analytically. Subsequently, an unsupervised probabilistic latent semantic analysis (PLSA)-based approach is employed at the same histogram-based video representation, characterizing each frame of video sequence into one of four view groups, namely closed-up-view, mid-view, long-view, and outer-field-view. Finally, a hidden conditional random field (HCRF) structured prediction model is utilized for interesting event detection. From experimental results, k-NN classifier using KL-divergence measurement demonstrates the best accuracy at 82.16% for genre categorization. Supervised SVM and unsupervised PLSA have average classification accuracies at 82.86% and 68.13%, respectively. The HCRF model achieves 92.31% accuracy using the unsupervised PLSA based label input, which is comparable with the supervised SVM based input at an accuracy of 93.08%. In general, such a systematic approach can be widely applied in processing massive videos generically.