Parallel neural networks for multimodal video genre classification

Authors:
Maurizio Montagnuolo;Alberto Messina
Affiliations:
Department of Computer Science, University of Turin, Turin, Italy 10149;Centre for Research and Technological Innovation, RAI Radiotelevisione Italiana, Turin, Italy 10135
Venue:
Multimedia Tools and Applications
Year:
2009

Citing 21
Cited 11

Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Color indexing

International Journal of Computer Vision
C4.5: programs for machine learning

C4.5: programs for machine learning
Digital video processing

Digital video processing
The nature of statistical learning theory

The nature of statistical learning theory
Automatic recognition of film genres

Proceedings of the third ACM international conference on Multimedia
Automatic Genre Identification for Content-Based Video Categorization

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Fuzzy Clustering for TV Program Classification

ITCC '04 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2 - Volume 2
Multimodal Video Indexing: A Review of the State-of-the-art

Multimedia Tools and Applications
Sports video categorizing method using camera motion parameters

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Detecting cartoons: a case study in automatic video-genre classification

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Video classification using spatial-temporal features and PCA

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 3 (ICME '03) - Volume 03
Automatic Sports Video Genre Classification using Pseudo-2D-HMM

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 04
Improving Program Guides for Reducing TV Stream Structuring Problem to a Simple Alignment Problem

CIMCA '06 Proceedings of the International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce
Video genre classification using dynamics

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
Automatic Genre Classification of TV Programmes Using Gaussian Mixture Models and Neural Networks

DEXA '07 Proceedings of the 18th International Conference on Database and Expert Systems Applications
Multimedia genre characterisation with fuzzy embedding classifiers

Proceedings of the 2008 Ambi-Sys workshop on Ambient media delivery and interactive television
Multimodal Genre Analysis Applied to Digital Television Archives

DEXA '08 Proceedings of the 2008 19th International Conference on Database and Expert Systems Application
A rough set approach to video genre classification

ACIVS'06 Proceedings of the 8th international conference on Advanced Concepts For Intelligent Vision Systems
Statistical models of video structure for content analysis and characterization

IEEE Transactions on Image Processing
A rule-based video annotation system

IEEE Transactions on Circuits and Systems for Video Technology

Content-based video genre classification using multiple cues

Proceedings of the 3rd international workshop on Automated information extraction in media production
Automatic video genre categorization and event detection techniques on large-scale sports data

Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
Similarity measurement for animation movies

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
A color-action perceptual approach to the classification of animated movies

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
A Generic Approach for Systematic Analysis of Sports Videos

ACM Transactions on Intelligent Systems and Technology (TIST)
A contour-color-action approach to automatic classification of several common video genres

AMR'10 Proceedings of the 8th international conference on Adaptive Multimedia Retrieval: context, exploration, and fusion
Content-based video description for automatic video genre categorization

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Discovering hot topics from geo-tagged video

Neurocomputing
Multimodal genre classification of TV programs and YouTube videos

Multimedia Tools and Applications
Who produced this video, amateur or professional?

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Improvements in digital technology have made possible the production and distribution of huge quantities of digital multimedia data. Tools for high-level multimedia documentation are becoming indispensable to efficiently access and retrieve desired content from such data. In this context, automatic genre classification provides a simple and effective solution to describe multimedia contents in a structured and well understandable way. We propose in this article a methodology for classifying the genre of television programmes. Features are extracted from four informative sources, which include visual-perceptual information (colour, texture and motion), structural information (shot length, shot distribution, shot rhythm, shot clusters duration and saturation), cognitive information (face properties, such as number, positions and dimensions) and aural information (transcribed text, sound characteristics). These features are used for training a parallel neural network system able to distinguish between seven video genres: football, cartoons, music, weather forecast, newscast, talk show and commercials. Experiments conducted on more than 100 h of audiovisual material confirm the effectiveness of the proposed method, which reaches a classification accuracy rate of 95%.