IEEE Transactions on Pattern Analysis and Machine Intelligence
Image Database Retrieval with Multiple-Instance Learning Techniques
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
The Journal of Machine Learning Research
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Discriminative Training for Object Recognition Using Image Patches
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Learning Object Categories from Google"s Image Search
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Detection of video sequences using compact signatures
ACM Transactions on Information Systems (TOIS)
Large-Scale Concept Ontology for Multimedia
IEEE MultiMedia
The challenge problem for automated detection of 101 semantic concepts in multimedia
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Real-time computerized annotation of pictures
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Motion pattern-based video classification and retrieval
EURASIP Journal on Applied Signal Processing
Multiple Bernoulli relevance models for image and video annotation
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
IEEE Transactions on Circuits and Systems for Video Technology
Identifying relevant frames in weakly labeled videos for training concept detectors
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Detecting pornographic video content by combining image features with motion information
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Learning automatic concept detectors from online video
Computer Vision and Image Understanding
Multimedia Tools and Applications
Reliability and effectiveness of clickthrough data for automatic image annotation
Multimedia Tools and Applications
The Tag Genome: Encoding Community Knowledge to Support Novel Interaction
ACM Transactions on Interactive Intelligent Systems (TiiS) - Special Issue on Common Sense for Interactive Systems
Pornography detection in video benefits (a lot) from a multi-modal approach
Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis
CONTENTUS--technologies for next generation multimedia libraries
Multimedia Tools and Applications
Hi-index | 0.00 |
We present a system that automatically tags videos, i.e. detects high-level semantic concepts like objects or actions in them. To do so, our system does not rely on datasets manually annotated for research purposes. Instead, we propose to use videos from online portals like youtube.com as a novel source of training data, whereas tags provided by users during upload serve as ground truth annotations. This allows our system to learn autonomously by automatically downloading its training set. The key contribution of this work is a number of large-scale quantitative experiments on real-world online videos, in which we investigate the influence of the individual system components, and how well our tagger generalizes to novel content. Our key results are: (1) Fair tagging results can be obtained by a late fusion of several kinds of visual features. (2) Using more than one keyframe per shot is helpful. (3) To generalize to different video content (e.g., another video portal), the system can be adapted by expanding its training set.