Local image tagging via graph regularized joint group sparsity

Authors:
Yang Yang;Zi Huang;Yi Yang;Jiajun Liu;Heng Tao Shen;Jiebo Luo
Affiliations:
School of Information Technology and Electrical Engineering, The University of Queensland, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, Australia;School of Computer Science, Carnegie Mellon University, USA;School of Information Technology and Electrical Engineering, The University of Queensland, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, Australia;Department of Computer Science, University of Rochester, USA
Venue:
Pattern Recognition
Year:
2013

Citing 29
Cited 1

Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Solution of the matrix equation AX + XB = C [F4]

Communications of the ACM
Region based image annotation through multiple-instance learning

Proceedings of the 13th annual ACM international conference on Multimedia
Label propagation through linear neighborhoods

ICML '06 Proceedings of the 23rd international conference on Machine learning
Semi-Supervised Classification Using Linear Neighborhood Propagation

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Region-based Image Annotation using Asymmetrical Support Vector Machine-based Multiple-Instance Learning

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Image annotation refinement using random walk with restarts

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Correlative multi-label video annotation

Proceedings of the 15th international conference on Multimedia
Annotating Images by Mining Image Search Results

IEEE Transactions on Pattern Analysis and Machine Intelligence
Online multi-label active annotation: towards large-scale content-based video search

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Labelling Image Regions Using Wavelet Features and Spatial Prototypes

SAMT '08 Proceedings of the 3rd International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context

International Journal of Computer Vision
Automatic video tagging using content redundancy

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Label to region by bi-layer sparsity priors

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Distance metric learning from uncertain side information with application to automated photo tagging

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Ranking with local regression and global alignment for cross media retrieval

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Learning social tag relevance by neighbor voting

IEEE Transactions on Multimedia
The segmented and annotated IAPR TC-12 benchmark

Computer Vision and Image Understanding
Image clustering using local discriminant models and global integration

IEEE Transactions on Image Processing - Special section on distributed camera networks: sensing, processing, communication, and implementation
Multi-label boosting for image annotation by structural grouping sparsity

Proceedings of the international conference on Multimedia
Mining multi-tag association for image tagging

World Wide Web
Probabilistic spatial context models for scene content understanding

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Transfer tagging from image to video

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Automatic tagging by exploring tag information capability and correlation

World Wide Web
Tag localization with spatial correlations and joint group sparsity

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Video Annotation Based on Kernel Linear Neighborhood Propagation

IEEE Transactions on Multimedia
Multi-Layer Multi-Instance Learning for Video Concept Detection

IEEE Transactions on Multimedia
Video Annotation Through Search and Graph Reinforcement Mining

IEEE Transactions on Multimedia
Image Decomposition With Multilabel Context: Algorithms and Applications

IEEE Transactions on Image Processing

Editor's Choice Article: Sparse feature selection based on graph Laplacian for web image annotation

Image and Vision Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

In recent years, massive amounts of web image data have been emerging on the web. How to precisely label these images is critical and challenging to modern image search engines. Due to the fact that web image contents are more and more complex, existing image-level tagging methods may become less effective and hardly achieve satisfactory performance. This raises an urgent need for the fine-grained tagging, e.g., region-level tagging. In this work, we study how to establish mapping between tags and image regions. In particular, a novel hierarchical local image tagging method is proposed to simultaneously assign tags to all the regions within the same image. We propose a Laplacian Joint Group Lasso (LJGL) model to jointly reconstruct the regions within a test image with a set of labeled training data. The LJGL model not only considers the robust encoding ability of joint group lasso but also preserves local structural information embedded in test regions. Besides, we extend the LJGL model to a kernel version in order to achieve the non-linear reconstruction. An effective algorithm is devised to optimize the objective function of the proposed model. Tags of training data are propagated to the reconstructed regions according to the reconstruction coefficients. Extensive experiments on four public image datasets demonstrate that our proposed models achieve significant performance improvements over the state-of-the-art methods in local image tagging.