A naive mid-level concept-based fusion approach to violence detection in Hollywood movies

Authors:
Bogdan Ionescu;Jan Schlüter;Ionut Mironica;Markus Schedl
Affiliations:
University Politehnica of Bucharest, Bucharest, Romania;Austrian Research Institute for Artificial Intelligence, Vienna, Austria;University Politehnica of Bucharest, Bucharest, Romania;Johannes Kepler Universität, Linz, Austria
Venue:
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Year:
2013

Citing 16
Cited 1

Person-on-Person Violence Detection in Video Data

ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 1 - Volume 1
Semantic context detection based on hierarchical audio models

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Automatically selecting shots for action movie trailers

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Semantic video analysis for psychological research on violence in computer games

Proceedings of the 6th ACM international conference on Image and video retrieval
Affective Characterization of Movie Scenes Based on Multimedia Content Analysis and User's Physiological Emotional Responses

ISM '08 Proceedings of the 2008 Tenth IEEE International Symposium on Multimedia
Detecting Violent Scenes in Movies by Auditory and Visual Cues

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
CASSANDRA: audio-video sensor fusion for aggression detection

AVSS '07 Proceedings of the 2007 IEEE Conference on Advanced Video and Signal Based Surveillance
Learning color names for real-world applications

IEEE Transactions on Image Processing
Weakly-Supervised Violence Detection in Movies with Audio and Video Based Co-training

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Pyramidal Multi-level Features for the Robot Vision@ICPR 2010 Challenge

ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Violence Detection in Video Using Spatio-Temporal Features

SIBGRAPI '10 Proceedings of the 2010 23rd SIBGRAPI Conference on Graphics, Patterns and Images
Violence detection in video using computer vision techniques

CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part II
Violence Detection in Movies

CGIV '11 Proceedings of the 2011 Eighth International Conference Computer Graphics, Imaging and Visualization
Audio-Visual fusion for detecting violent scenes in videos

SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Affective video content representation and modeling

IEEE Transactions on Multimedia
Shot-boundary detection: unraveled and resolved?

IEEE Transactions on Circuits and Systems for Video Technology

Violent scene detection using mid-level feature

Proceedings of the Fourth Symposium on Information and Communication Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we approach the issue of violence detection in typical Hollywood productions. Given the high variability in appearance of violent scenes in movies, training a classifier to predict violent frames directly from visual or/and auditory features seems rather difficult. Instead, we propose a different perspective that relies on fusing mid-level concept predictions that are inferred from low-level features. This is achieved by employing a bank of multi-layer perceptron classifiers featuring a dropout training scheme. Experimental validation conducted in the context of the Violent Scenes Detection task of the MediaEval 2012 Multimedia Benchmark Evaluation show the potential of this approach that ranked first among 34 other submissions in terms of precision and F1-score.