Optimizing multi-graph learning: towards a unified video annotation scheme

Authors:
Meng Wang;Xian-Sheng Hua;Xun Yuan;Yan Song;Li-Rong Dai
Affiliations:
University of Science and Technology of China, Hefei, China;Microsoft Research Asia, Beijing, China;University of Science and Technology of China, Hefei, China;University of Science and Technology of China, Hefei, China;University of Science and Technology of China, Hefei, China
Venue:
Proceedings of the 15th international conference on Multimedia
Year:
2007

Citing 22
Cited 34

Toward Improved Ranking Metrics

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
User-trainable video annotation using multimodal cues

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The combination limit in multimedia retrieval

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Manifold-ranking based image retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis

Proceedings of the 12th annual ACM international conference on Multimedia
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
Semi-Supervised Cross Feature Learning for Semantic Concept Detection in Videos

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Hidden Markov models for automatic annotation and content-based retrieval of images and video

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Mining multimedia salient concepts for incremental information extraction

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Graph based multi-modality learning

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Semi-automatic video annotation based on active learning with multiple complementary predictors

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Semi-supervised learning with graphs

Semi-supervised learning with graphs
Content-based video retrieval: does video's semantic visual feature matter?

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Toward Robust Distance Metric Analysis for Similarity Estimation

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Exploring temporal consistency for video analysis and retrieval

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Manifold-ranking based video concept detection on large database and feature pool

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Automatic video annotation by semi-supervised learning with kernel density estimation

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Lessons for the future from a decade of informedia video analysis research

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval

Annotating personal albums via web mining

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Integrated graph-based semi-supervised multiple/single instance learning framework for image annotation

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Transductive multi-label learning for video concept detection

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Semi-supervised kernel density estimation for video annotation

Computer Vision and Image Understanding
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
Beyond distance measurement: constructing neighborhood similarity for video annotation

IEEE Transactions on Multimedia - Special section on communities and media computing
Tensor-based transductive learning for multimodality video semantic concept detection

IEEE Transactions on Multimedia
Semi-supervised bilinear subspace learning

IEEE Transactions on Image Processing
Multigraph-based query-independent learning for video search

IEEE Transactions on Circuits and Systems for Video Technology
Personalized online video recommendation by neighborhood score propagation based global ranking

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Evolutionary cross-domain discriminative hessian eigenmaps

IEEE Transactions on Image Processing
Multiview spectral embedding

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Semi-automatic flickr group suggestion

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Fusing heterogeneous modalities for video and image re-ranking

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
A transductive multi-label learning approach for video concept detection

Pattern Recognition
Multi-layer graph-based semi-supervised learning for large-scale image datasets using mapreduce

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Difficulty guided image retrieval using linear multiview embedding

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Automatic tag generation and ranking for sensor-rich outdoor videos

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Exploiting the entire feature space with sparsity for automatic image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Social image annotation via cross-domain subspace learning

Multimedia Tools and Applications
Image classification by multimodal subspace learning

Pattern Recognition Letters
Multi-graph multi-instance learning for object-based image and video retrieval

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Classifier-specific intermediate representation for multimedia tasks

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Exploring multi-modality structure for cross domain adaptation in video concept annotation

Neurocomputing
Local goemetrical feature with spatial context for shape-based 3D model retrieval

EG 3DOR'12 Proceedings of the 5th Eurographics conference on 3D Object Retrieval
Knowledge adaptation for ad hoc multimedia event detection with few exemplars

Proceedings of the 20th ACM international conference on Multimedia
Interactive social group recommendation for Flickr photos

Neurocomputing
Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets

Journal of Visual Communication and Image Representation
Local 3d symmetry for visual saliency in 2.5d point clouds

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Web page and image semi-supervised classification with heterogeneous information fusion

Journal of Information Science
Video content categorization using the double decomposition

Multimedia Tools and Applications
Multi-view semi-supervised web image classification via co-graph

Neurocomputing
Multiview Hessian discriminative sparse coding for image annotation

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

Learning based semantic video annotation is a promising approach for enabling content-based video search. However, severe difficulties, such as insufficiency of training data and curse of dimensionality, are frequently encountered. This paper proposes a novel unified scheme, Optimized Multi-Graph-based Semi-Supervised Learning (OMG-SSL), to simultaneously attack these difficulties. Instead of only using a single graph, OMG-SSL integrates multiple graphs into a regularization and optimization framework to sufficiently explore their complementary nature. We then show that various crucial factors in video annotation, including multiple modalities, multiple distance metrics, and temporal consistency, in fact all correspond to different correlations among samples, and hence they can be represented by different graphs. Therefore, OMG-SSL is able to simultaneously deal with these factors within a unified framework. Experiments on the TRECVID benchmark demonstrate the effectiveness of our proposed approach.