Enriching and localizing semantic tags in internet videos

Authors:
Lamberto Ballan;Marco Bertini;Alberto Del Bimbo;Giuseppe Serra
Affiliations:
University of Florence, Florence, Italy;University of Florence, Florence, Italy;University of Florence, Florence, Italy;University of Florence, Florence, Italy
Venue:
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Year:
2011

Citing 14
Cited 2

To search or to label?: predicting the performance of search-based automatic image classifiers

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Tag ranking

Proceedings of the 18th international conference on World wide web
Learning social tag relevance by neighbor voting

IEEE Transactions on Multimedia
Learning automatic concept detectors from online video

Computer Vision and Image Understanding
Unsupervised multi-feature tag relevance learning for social image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Scalable clip-based near-duplicate video detection with ordinal measure

Proceedings of the ACM International Conference on Image and Video Retrieval
Semantic annotation of personal video content using an image folksonomy

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Topic discovery of web video using star-structured K-partite graph

Proceedings of the international conference on Multimedia
TOP-SURF: a visual words toolkit

Proceedings of the international conference on Multimedia
Tag suggestion and localization in user-generated videos based on social knowledge

Proceedings of second ACM SIGMM workshop on Social media
Content-based tag processing for Internet social images

Multimedia Tools and Applications
ShotTagger: tag location for internet videos

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
On the Annotation of Web Videos by Efficient Near-Duplicate Search

IEEE Transactions on Multimedia

A social network for video annotation and discovery based on semantic profiling

Proceedings of the 21st international conference companion on World Wide Web
Automatic Abstract Tag Detection for Social Image Tag Refinement and Enrichment

Journal of Signal Processing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Tagging of multimedia content is becoming more and more widespread as web 2.0 sites, like Flickr and Facebook for images, YouTube and Vimeo for videos, have popularized tagging functionalities among their users. These user-generated tags are used to retrieve multimedia content, and to ease browsing and exploration of media collections, e.g.~using tag clouds. However, not all media are equally tagged by users: using the current browsers is easy to tag a single photo, and even tagging a part of a photo, like a face, has become common in sites like Flickr and Facebook; on the other hand tagging a video sequence is more complicated and time consuming, so that users just tag the overall content of a video. In this paper we present a system for automatic video annotation that increases the number of tags originally provided by users, and localizes them temporally, associating tags to shots. This approach exploits collective knowledge embedded in tags and Wikipedia, and visual similarity of keyframes and images uploaded to social sites like YouTube and Flickr.