Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Indoor-Outdoor Image Classification
CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
CAIVL '97 Proceedings of the 1997 Workshop on Content-Based Access of Image and Video Libraries (CBAIVL '97)
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Automatic image annotation and retrieval using cross-media relevance models
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Efficient Graph-Based Image Segmentation
International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Effective automatic image annotation via a coherent language model and active learning
Proceedings of the 12th annual ACM international conference on Multimedia
LOCUS: Learning Object Classes with Unsupervised Segmentation
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Correlated Label Propagation with Application to Multi-label Learning
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Working Set Selection Using Second Order Information for Training Support Vector Machines
The Journal of Machine Learning Research
ML-KNN: A lazy learning approach to multi-label learning
Pattern Recognition
Exploiting spatial context constraints for automatic image region annotation
Proceedings of the 15th international conference on Multimedia
Dual cross-media relevance model for image annotation
Proceedings of the 15th international conference on Multimedia
NUS-WIDE: a real-world web image database from National University of Singapore
Proceedings of the ACM International Conference on Image and Video Retrieval
Multiple Bernoulli relevance models for image and video annotation
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Proceedings of the ACM International Conference on Image and Video Retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Unified tag analysis with multi-edge graph
Proceedings of the international conference on Multimedia
Image segmentation with patch-pair density priors
Proceedings of the international conference on Multimedia
Vicept: link visual features to concepts for large-scale image understanding
Proceedings of the international conference on Multimedia
Automatic image tagging via category label and web data
Proceedings of the international conference on Multimedia
Fuzzy based contextual cueing for region level annotation
ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Content-based tag processing for Internet social images
Multimedia Tools and Applications
Mining multi-tag association for image tagging
World Wide Web
Video-to-shot tag allocation by weighted sparse group lasso
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Combining image-level and segment-level models for automatic annotation
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration
ACM Computing Surveys (CSUR)
Label-to-region with continuity-biased bi-layer sparsity priors
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Social tag alignment with image regions by sparse reconstructions
Proceedings of the 20th ACM international conference on Multimedia
Local image tagging via graph regularized joint group sparsity
Pattern Recognition
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval
Computer Vision and Image Understanding
Hi-index | 0.00 |
In this work, we investigate how to automatically reassign the manually annotated labels at the image-level to those contextually derived semantic regions. First, we propose a bi-layer sparse coding formulation for uncovering how an image or semantic region can be robustly reconstructed from the over-segmented image patches of an image set. We then harness it for the automatic label to region assignment of the entire image set. The solution to bi-layer sparse coding is achieved by convex l1-norm minimization. The underlying philosophy of bi-layer sparse coding is that an image or semantic region can be sparsely reconstructed via the atomic image patches belonging to the images with common labels, while the robustness in label propagation requires that these selected atomic patches come from very few images. Each layer of sparse coding produces the image label assignment to those selected atomic patches and merged candidate regions based on the shared image labels. The results from all bi-layer sparse codings over all candidate regions are then fused to obtain the entire label to region assignments. Besides, the presenting bi-layer sparse coding framework can be naturally applied to perform image annotation on new test images. Extensive experiments on three public image datasets clearly demonstrate the effectiveness of our proposed framework in both label to region assignment and image annotation tasks.