International Journal of Computer Vision
ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
A semantic framework for video genre classification and event analysis
Image Communication
Content-based video genre classification using multiple cues
Proceedings of the 3rd international workshop on Automated information extraction in media production
Towards textually describing complex video contents with audio-visual concept classifiers
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Content-based video description for automatic video genre categorization
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Automatic Video Classification: A Survey of the Literature
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis
Proceedings of the 20th ACM international conference on Multimedia
Who produced this video, amateur or professional?
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Hi-index | 0.00 |
This paper investigates the classification of short user-generated videos (UGVs) using the accompanied audio data since short UGVs accounts for a great proportion of the Internet UGVs and many short UGVs are accompanied by single-category soundtracks. We define seven types of UGVs corresponding to seven audio categories respectively. We also investigate three modeling approaches for audio feature representation, namely, single Gaussian (1G), Gaussian mixture (GMM) and Bag-of-Audio-Word (BoAW) models. Then using Support Vector Machine (SVM) with three different distance measurements corresponding to three feature representations, classifiers are trained to categorize the UGVs. The accompanying evaluation results show that these approaches are effective for categorizing the short UGVs based on their audio track. Experimental results show that a GMM representation with approximated Bhattacharyya distance (ABD) measurement produces the best performance, and BoAW representation with chi_square kernel also reports comparable results.