E-LAMP: integration of innovative ideas for multimedia event detection

Authors:
Wei Tong;Yi Yang;Lu Jiang;Shoou-I Yu;Zhenzhong Lan;Zhigang Ma;Waito Sze;Ehsan Younessian;Alexander G. Hauptmann
Affiliations:
Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Department of Information Engineering and Computer Science, University of Trento, Trento, Italy;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA;Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA
Venue:
Machine Vision and Applications
Year:
2014

Citing 29
Cited 1

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
Comparison of different implementations of MFCC

Journal of Computer Science and Technology
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Live sports event detection based on broadcast video and web-casting text

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Robust Real-Time Unusual Event Detection using Multiple Fixed-Location Monitors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploring knowledge of sub-domain in a multi-resolution bootstrapping framework for concept detection in news video

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Event recognition: viewing the world with a third eye

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Performance evaluation of local colour invariants

Computer Vision and Image Understanding
An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Spatial extensions to bag of visual words

Proceedings of the ACM International Conference on Image and Video Retrieval
Classifier fusion for SVM-based multimedia semantic indexing

ECIR'07 Proceedings of the 29th European conference on IR research
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Detecting events by clustering videos from large media databases

Proceedings of the 2nd ACM international workshop on Events in multimedia
A fast MAP adaptation technique for gmm-supervector-based video semantic indexing systems

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Modeling and representing events in multimedia

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Trajectories based descriptor for dynamic events annotation

J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Acoustic super models for large scale video event detection

J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Double fusion for multimedia event detection

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Online detection of unusual events in videos via dynamic sparse coding

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework

IEEE Transactions on Multimedia
Event detection in field sports video using audio-visual features and a support vector Machine

IEEE Transactions on Circuits and Systems for Video Technology
Evaluation of low-level features and their combinations for complex event detection in open source videos

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Multimodal feature fusion for robust event detection in web videos

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Leveraging high-level and low-level features for multimedia event detection

Proceedings of the 20th ACM international conference on Multimedia
Knowledge adaptation for ad hoc multimedia event detection with few exemplars

Proceedings of the 20th ACM international conference on Multimedia

Special issue on Multimedia Event Detection

Machine Vision and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Detecting multimedia events in web videos is an emerging hot research area in the fields of multimedia and computer vision. In this paper, we introduce the core methods and technologies of the framework we developed recently for our Event Labeling through Analytic Media Processing (E-LAMP) system to deal with different aspects of the overall problem of event detection. More specifically, we have developed efficient methods for feature extraction so that we are able to handle large collections of video data with thousands of hours of videos. Second, we represent the extracted raw features in a spatial bag-of-words model with more effective tilings such that the spatial layout information of different features and different events can be better captured, thus the overall detection performance can be improved. Third, different from widely used early and late fusion schemes, a novel algorithm is developed to learn a more robust and discriminative intermediate feature representation from multiple features so that better event models can be built upon it. Finally, to tackle the additional challenge of event detection with only very few positive exemplars, we have developed a novel algorithm which is able to effectively adapt the knowledge learnt from auxiliary sources to assist the event detection. Both our empirical results and the official evaluation results on TRECVID MED'11 and MED'12 demonstrate the excellent performance of the integration of these ideas.