Effective transfer tagging from image to video

Authors:
Yang Yang;Yi Yang;Heng Tao Shen
Affiliations:
The University of Queensland, St. Lucia, QLD;Carnegie Mellon University, Pittsburgh, PA;The University of Queensland, St. Lucia, QLD
Venue:
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Year:
2013

Citing 28
Cited 2

Integrating structured biological data by Kernel Maximum Mean Discrepancy

Bioinformatics
Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples

The Journal of Machine Learning Research
Boosting for transfer learning

Proceedings of the 24th international conference on Machine learning
Kodak's consumer video benchmark data set: concept definition and annotation

Proceedings of the international workshop on Workshop on multimedia information retrieval
Cross-domain video concept detection using adaptive svms

Proceedings of the 15th international conference on Multimedia
Structure-sensitive manifold ranking for video concept detection

Proceedings of the 15th international conference on Multimedia
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
The MIR flickr retrieval evaluation

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Semi-supervised kernel density estimation for video annotation

Computer Vision and Image Understanding
Transfer learning via dimensionality reduction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Visual query suggestion

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Semantic context transfer across heterogeneous sources for domain adaptive video search

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Ranking with local regression and global alignment for cross media retrieval

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Beyond distance measurement: constructing neighborhood similarity for video annotation

IEEE Transactions on Multimedia - Special section on communities and media computing
L2 regularization for learning kernels

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Visual query suggestion: Towards capturing user intent in internet image search

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Mining multi-tag association for image tagging

World Wide Web
Transfer tagging from image to video

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Video-to-shot tag allocation by weighted sparse group lasso

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Learning heterogeneous data for hierarchical web video classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Tag localization with spatial correlations and joint group sparsity

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Video Annotation Based on Kernel Linear Neighborhood Propagation

IEEE Transactions on Multimedia
Towards a Relevant and Diverse Search of Social Images

IEEE Transactions on Multimedia
Interactive Video Indexing With Statistical Active Learning

IEEE Transactions on Multimedia
Robust cross-media transfer for visual event detection

Proceedings of the 20th ACM international conference on Multimedia

Multimedia summarization for trending topics in microblogs

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Learning to Recommend Descriptive Tags for Questions in Social Forums

ACM Transactions on Information Systems (TOIS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent years have witnessed a great explosion of user-generated videos on the Web. In order to achieve an effective and efficient video search, it is critical for modern video search engines to associate videos with semantic keywords automatically. Most of the existing video tagging methods can hardly achieve reliable performance due to deficiency of training data. It is noticed that abundant well-tagged data are available in other relevant types of media (e.g., images). In this article, we propose a novel video tagging framework, termed as Cross-Media Tag Transfer (CMTT), which utilizes the abundance of well-tagged images to facilitate video tagging. Specifically, we build a “cross-media tunnel” to transfer knowledge from images to videos. To this end, an optimal kernel space, in which distribution distance between images and video is minimized, is found to tackle the domain-shift problem. A novel cross-media video tagging model is proposed to infer tags by exploring the intrinsic local structures of both labeled and unlabeled data, and learn reliable video classifiers. An efficient algorithm is designed to optimize the proposed model in an iterative and alternative way. Extensive experiments illustrate the superiority of our proposal compared to the state-of-the-art algorithms.