Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Solution of the matrix equation AX + XB = C [F4]
Communications of the ACM
Region based image annotation through multiple-instance learning
Proceedings of the 13th annual ACM international conference on Multimedia
Label propagation through linear neighborhoods
ICML '06 Proceedings of the 23rd international conference on Machine learning
Semi-Supervised Classification Using Linear Neighborhood Propagation
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Image annotation refinement using random walk with restarts
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Correlative multi-label video annotation
Proceedings of the 15th international conference on Multimedia
Annotating Images by Mining Image Search Results
IEEE Transactions on Pattern Analysis and Machine Intelligence
Online multi-label active annotation: towards large-scale content-based video search
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Labelling Image Regions Using Wavelet Features and Spatial Prototypes
SAMT '08 Proceedings of the 3rd International Conference on Semantic and Digital Media Technologies: Semantic Multimedia
International Journal of Computer Vision
Automatic video tagging using content redundancy
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Label to region by bi-layer sparsity priors
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Distance metric learning from uncertain side information with application to automated photo tagging
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Ranking with local regression and global alignment for cross media retrieval
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Learning social tag relevance by neighbor voting
IEEE Transactions on Multimedia
The segmented and annotated IAPR TC-12 benchmark
Computer Vision and Image Understanding
Image clustering using local discriminant models and global integration
IEEE Transactions on Image Processing - Special section on distributed camera networks: sensing, processing, communication, and implementation
Multi-label boosting for image annotation by structural grouping sparsity
Proceedings of the international conference on Multimedia
Mining multi-tag association for image tagging
World Wide Web
Probabilistic spatial context models for scene content understanding
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Transfer tagging from image to video
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Tag localization with spatial correlations and joint group sparsity
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Video Annotation Based on Kernel Linear Neighborhood Propagation
IEEE Transactions on Multimedia
Multi-Layer Multi-Instance Learning for Video Concept Detection
IEEE Transactions on Multimedia
Video Annotation Through Search and Graph Reinforcement Mining
IEEE Transactions on Multimedia
Image Decomposition With Multilabel Context: Algorithms and Applications
IEEE Transactions on Image Processing
Editor's Choice Article: Sparse feature selection based on graph Laplacian for web image annotation
Image and Vision Computing
Hi-index | 0.01 |
In recent years, massive amounts of web image data have been emerging on the web. How to precisely label these images is critical and challenging to modern image search engines. Due to the fact that web image contents are more and more complex, existing image-level tagging methods may become less effective and hardly achieve satisfactory performance. This raises an urgent need for the fine-grained tagging, e.g., region-level tagging. In this work, we study how to establish mapping between tags and image regions. In particular, a novel hierarchical local image tagging method is proposed to simultaneously assign tags to all the regions within the same image. We propose a Laplacian Joint Group Lasso (LJGL) model to jointly reconstruct the regions within a test image with a set of labeled training data. The LJGL model not only considers the robust encoding ability of joint group lasso but also preserves local structural information embedded in test regions. Besides, we extend the LJGL model to a kernel version in order to achieve the non-linear reconstruction. An effective algorithm is devised to optimize the objective function of the proposed model. Tags of training data are propagated to the reconstructed regions according to the reconstruction coefficients. Extensive experiments on four public image datasets demonstrate that our proposed models achieve significant performance improvements over the state-of-the-art methods in local image tagging.