Video genre classification using dynamics
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
Fuzzy color-based approach for understanding animated movies content in the indexing task
Journal on Image and Video Processing - Color in Image and Video Processing
Parallel neural networks for multimodal video genre classification
Multimedia Tools and Applications
An Approach to the Parameterization of Structure for Fast Categorization
International Journal of Computer Vision
Automatic Video Classification: A Survey of the Literature
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
The MPEG-7 visual standard for content description-an overview
IEEE Transactions on Circuits and Systems for Video Technology
Cross-modal categorisation of user-generated video sequences
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Short user-generated videos classification using accompanied audio categories
Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis
Who produced this video, amateur or professional?
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Hi-index | 0.00 |
In this paper, we propose an audio-visual approach to video genre categorization. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At temporal structural level, we asses action contents with respect to human perception. Further, color perception is quantified with statistics of color distribution, elementary hues, color properties and relationship of color. The last category of descriptors determines statistics of contour geometry. An extensive evaluation of this multi-modal approach based on on more than 91 hours of video footage is presented. We obtain average precision and recall ratios within [87%−100%] and [77%−100%], respectively, while average correct classification is up to 97%. Additionally, movies displayed according to feature-based coordinates in a virtual 3D browsing environment tend to regroup with respect to genre, which has potential application with real content-based browsing systems.