Unified video annotation via multigraph learning

Authors:
Meng Wang;Xian-Sheng Hua;Richang Hong;Jinhui Tang;Guo-Jun Qi;Yan Song
Affiliations:
Microsoft Research Asia, Beijing, P. R. China;Microsoft Research Asia, Beijing, P. R. China;University of Science and Technology of China, Hefei, P. R. China;University of Science and Technology of China, Hefei, P. R. China;University of Science and Technology of China, Hefei, P. R. China;University of Science and Technology of China, Hefei, P. R. China
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
2009

Citing 35
Cited 110

Toward Improved Ranking Metrics

IEEE Transactions on Pattern Analysis and Machine Intelligence
User-trainable video annotation using multimodal cues

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Discriminative model fusion for semantic concept detection and annotation in video

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
The combination limit in multimedia retrieval

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Manifold-ranking based image retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis

Proceedings of the 12th annual ACM international conference on Multimedia
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
Semi-Supervised Cross Feature Learning for Semantic Concept Detection in Videos

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Graph based multi-modality learning

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Semi-automatic video annotation based on active learning with multiple complementary predictors

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Semi-supervised learning with graphs

Semi-supervised learning with graphs
Content-based video retrieval: does video's semantic visual feature matter?

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Toward Robust Distance Metric Analysis for Similarity Estimation

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Large-Scale Concept Ontology for Multimedia

IEEE MultiMedia
The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploring temporal consistency for video analysis and retrieval

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Efficient semantic annotation method for indexing large personal video database

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Evaluation campaigns and TRECVid

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Manifold-ranking based video concept detection on large database and feature pool

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Semi-supervised learning for semantic video retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Towards optimal bag-of-features for object categorization and semantic video retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Video search in concept subspace: a text-like paradigm

Proceedings of the 6th ACM international conference on Image and video retrieval
Information-theoretic semantic multimedia indexing

Proceedings of the 6th ACM international conference on Image and video retrieval
Video diver: generic video indexing with diverse features

Proceedings of the international workshop on Workshop on multimedia information retrieval
Video annotation by graph-based learning with neighborhood similarity

Proceedings of the 15th international conference on Multimedia
Optimizing multi-graph learning: towards a unified video annotation scheme

Proceedings of the 15th international conference on Multimedia
Semi-supervised kernel density estimation for video annotation

Computer Vision and Image Understanding
An efficient algorithm for local distance metric learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Lessons for the future from a decade of informedia video analysis research

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News

IEEE Transactions on Multimedia
Video Annotation Based on Kernel Linear Neighborhood Propagation

IEEE Transactions on Multimedia

Video semantic analysis based on structure-sensitive anisotropic manifold ranking

Signal Processing
Visual tag dictionary: interpreting tags with visual words

WSMC '09 Proceedings of the 1st workshop on Web-scale multimedia corpus
Metric learning with feature decomposition for image categorization

Neurocomputing
Exploring large scale data for multimedia QA: an initial study

Proceedings of the ACM International Conference on Image and Video Retrieval
Joint learning of labels and distance metric

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
An integrated aurora image retrieval system: AuroraEye

Journal of Visual Communication and Image Representation
Dynamic captioning: video accessibility enhancement for hearing impairment

Proceedings of the international conference on Multimedia
Active learning in multimedia annotation and retrieval: A survey

ACM Transactions on Intelligent Systems and Technology (TIST)
ShotTagger: tag location for internet videos

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Locally regressive G-optimal design for image retrieval

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
VisionGo: Towards video retrieval with joint exploration of human and computer

Information Sciences: an International Journal
Optimizing multimodal reranking for web image search

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Cost-aware travel tour recommendation

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Video accessibility enhancement for hearing-impaired users

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Recent advances and trends in visual tracking: A review

Neurocomputing
Beyond search: Event-driven summarization for web videos

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A pseudo relevance feedback based cross domain video concept detection

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Videoader: a video advertising system based on intelligent analysis of visual content

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
An online video recommendation framework using rich information

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Probabilistic indexing of media sequences

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Exploiting context aware category discovery for image labeling

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Hierarchical user interest modeling for Chinese web pages

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection

Image Communication
Classification and annotation in social corpora using multiple relations

Proceedings of the 20th ACM international conference on Information and knowledge management
Shared feature extraction for semi-supervised image classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Query expansion by spatial co-occurrence for image retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Large scale image search with geometric coding

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Shared feature extraction for semi-supervised image classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Spatial pooling for transformation invariant image representation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Integrating rich information for video recommendation with multi-task rank aggregation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
News contextualization with geographic and visual information

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Towards multi-semantic image annotation with graph regularized exclusive group lasso

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Exploiting the entire feature space with sparsity for automatic image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multiple feature hashing for real-time large scale near-duplicate video retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Learning heterogeneous data for hierarchical web video classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Learning concept bundles for video search with complex queries

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Intelligent photo clustering with user interaction and distance metric learning

Pattern Recognition Letters
Strengthening learning algorithms by feature discovery

Information Sciences: an International Journal
Semantic concept detection for user-generated video content using a refined image folksonomy

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Oracle in Image Search: A Content-Based Approach to Performance Prediction

ACM Transactions on Information Systems (TOIS)
k-Partite graph reinforcement and its application in multimedia information retrieval

Information Sciences: an International Journal
On video recommendation over social network

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Finding suits in images of people

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
RGB-D based multi-attribute people search in intelligent visual surveillance

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
Query difficulty estimation for image retrieval

Neurocomputing
A probabilistic graphical model for topic and preference discovery on social media

Neurocomputing
Relationship strength estimation for online social networks with the study on Facebook

Neurocomputing
Nearest-neighbor method using multiple neighborhood similarities for social media data mining

Neurocomputing
Constructing visual tag dictionary by mining community-contributed media corpus

Neurocomputing
Collaborative visual modeling for automatic image annotation via sparse model coding

Neurocomputing
On-line video abstract generation of multimedia news

Multimedia Tools and Applications
In-video product annotation with web information mining

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Personalized video recommendation based on viewing history with the study on YouTube

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Weighted association rule mining via a graph based connectivity model

Information Sciences: an International Journal
A method for detecting salient regions using integrated features

Proceedings of the 20th ACM international conference on Multimedia
Hypergraph-based multi-example ranking with sparse representation for transductive learning image retrieval

Neurocomputing
A regularization framework in polar coordinates for transductive learning in networked data

Information Sciences: an International Journal
Image classification based on effective extreme learning machine

Neurocomputing
Attribute learning for understanding unstructured social activity

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Combining SIFT and global features for web image classification

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Feature selection for high-dimensional imbalanced data

Neurocomputing
A reward-and-punishment-based approach for concept detection using adaptive ontology rules

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Effective transfer tagging from image to video

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
An approach of bag-of-words based on visual attention model for pornographic images recognition in compressed domain

Neurocomputing
Rank canonical correlation analysis and its application in visual search reranking

Signal Processing
Multimedia encyclopedia construction by mining web knowledge

Signal Processing
Compressed domain based pornographic image recognition using multi-cost sensitive decision trees

Signal Processing
Hierarchical affective content analysis in arousal and valence dimensions

Signal Processing
Image classification with manifold learning for out-of-sample data

Signal Processing
Marginalized multi-layer multi-instance kernel for video concept detection

Signal Processing
Social image tagging using graph-based reinforcement on multi-type interrelated objects

Signal Processing
Energy-saving object detection by efficiently rejecting a set of neighboring sub-images

Signal Processing
High order pLSA for indexing tagged images

Signal Processing
An improved method of locality sensitive hashing for indexing large-scale and high-dimensional features

Signal Processing
Dual local consistency hashing with discriminative projections selection

Signal Processing
Weakly supervised codebook learning by iterative label propagation with graph quantization

Signal Processing
ε-Isometry based shape approximation for image content representation

Signal Processing
Residual enhanced visual vector as a compact signature for mobile visual search

Signal Processing
Frequency-based similarity for parameterized sequences: Formal framework, algorithms, and applications

Information Sciences: an International Journal
Literature survey of active learning in multimedia annotation and retrieval

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
The preliminary research on feature extraction and classification of acne tongue picture in TCM

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Ensemble multi-label learning based on neural network

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Click-boosting random walk for image search reranking

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Image classification with saliency region and multi-task learning

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Large-margin multi-view Gaussian process for image classification

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Personalized image retrieval in compressed domain based on user interest model

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Efficient search with multi-modality for video commercial retrieval

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Facial expression recognition based on Hessian regularized support vector machine

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Hierarchical privacy preservation for personalized image retrieval

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
An automatic face log collection method for video sequence

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Multi-view hypergraph learning by patch alignment framework

Neurocomputing
Towards metric fusion on multi-view data: a cross-view based graph random walk approach

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Photo 4W: Mobile photo management on what, where, who and when

Neurocomputing
Scene image retrieval via re-ranking semantic and packed dense interestpoints

Neurocomputing
Video event description in scene context

Neurocomputing
Advertising object in web videos

Neurocomputing
A flexible 3D cerebrovascular extraction from TOF-MRA images

Neurocomputing
Adaptive all-season image tag ranking by saliency-driven image pre-classification

Journal of Visual Communication and Image Representation
Robust unsupervised feature selection

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Protein function prediction by integrating multiple kernels

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Large-scale multilabel propagation based on efficient sparse graph construction

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Learning latent representations of nodes for classifying in heterogeneous social networks

Proceedings of the 7th ACM international conference on Web search and data mining
Semantic context based refinement for news video annotation

Multimedia Tools and Applications
Combining supervised and unsupervised models via unconstrained probabilistic embedding

Information Sciences: an International Journal
Semi-supervised discriminative common vector method for computer vision applications

Neurocomputing
Fuzzy deep belief networks for semi-supervised sentiment classification

Neurocomputing
QuMinS: Fast and scalable querying, mining and summarizing multi-modal databases

Information Sciences: an International Journal
Counting crowd flow based on feature points

Neurocomputing
Embedded local feature selection within mixture of experts

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Learning-based video annotation is a promising approach to facilitating video retrieval and it can avoid the intensive labor costs of pure manual annotation. But it frequently encounters several difficulties, such as insufficiency of training data and the curse of dimensionality. In this paper, we propose a method named optimized multigraph-based semi-supervised learning (OMG-SSL), which aims to simultaneously tackle these difficulties in a unified scheme. We show that various crucial factors in video annotation, including multiple modalities, multiple distance functions, and temporal consistency, all correspond to different relationships among video units, and hence they can be represented by different graphs. Therefore, these factors can be simultaneously dealt with by learning with multiple graphs, namely, the proposed OMG-SSL approach. Different from the existing graph-based semi-supervised learning methods that only utilize one graph, OMG-SSL integrates multiple graphs into a regularization framework in order to sufficiently explore their complementation. We show that this scheme is equivalent to first fusing multiple graphs and then conducting semi-supervised learning on the fused graph. Through an optimization approach, it is able to assign suitable weights to the graphs. Furthermore, we show that the proposed method can be implemented through a computationally efficient iterative process. Extensive experiments on the TREC video retrieval evaluation (TRECVID) benchmark have demonstrated the effectiveness and efficiency of our proposed approach.