Label to region by bi-layer sparsity priors

Authors:
Xiaobai Liu;Bin Cheng;Shuicheng Yan;Jinhui Tang;Tat Seng Chua;Hai Jin
Affiliations:
National University of Singapore/ Huazhong University of Science and Technology, Singapore/ Wuhan, China, Singapore;National University of Singapore, Singapore;National University of Singapore, Singapore;National University of Singapore, Singapore;National University of Singapore, Singapore;Huazhong University of Science and Technology, Wuhan, China
Venue:
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Year:
2009

Citing 17
Cited 17

Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Indoor-Outdoor Image Classification

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Locating Deciduous Trees

CAIVL '97 Proceedings of the 1997 Workshop on Content-Based Access of Image and Video Libraries (CBAIVL '97)
Body plans

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Efficient Graph-Based Image Segmentation

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Effective automatic image annotation via a coherent language model and active learning

Proceedings of the 12th annual ACM international conference on Multimedia
LOCUS: Learning Object Classes with Unsupervised Segmentation

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Correlated Label Propagation with Application to Multi-label Learning

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Working Set Selection Using Second Order Information for Training Support Vector Machines

The Journal of Machine Learning Research
ML-KNN: A lazy learning approach to multi-label learning

Pattern Recognition
Exploiting spatial context constraints for automatic image region annotation

Proceedings of the 15th international conference on Multimedia
Dual cross-media relevance model for image annotation

Proceedings of the 15th international conference on Multimedia
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
TextonBoost: joint appearance, shape and context modeling for multi-class object recognition and segmentation

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I

Beyond tag relevance: integrating visual attention model and multi-instance learning for tag saliency ranking

Proceedings of the ACM International Conference on Image and Video Retrieval
Image search by concept map

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Unified tag analysis with multi-edge graph

Proceedings of the international conference on Multimedia
Image segmentation with patch-pair density priors

Proceedings of the international conference on Multimedia
Vicept: link visual features to concepts for large-scale image understanding

Proceedings of the international conference on Multimedia
Automatic image tagging via category label and web data

Proceedings of the international conference on Multimedia
Fuzzy based contextual cueing for region level annotation

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Content-based tag processing for Internet social images

Multimedia Tools and Applications
Mining multi-tag association for image tagging

World Wide Web
Video-to-shot tag allocation by weighted sparse group lasso

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Combining image-level and segment-level models for automatic annotation

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Combining visual attention model with multi-instance learning for tag ranking

Neurocomputing
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
Label-to-region with continuity-biased bi-layer sparsity priors

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Social tag alignment with image regions by sparse reconstructions

Proceedings of the 20th ACM international conference on Multimedia
Local image tagging via graph regularized joint group sparsity

Pattern Recognition
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work, we investigate how to automatically reassign the manually annotated labels at the image-level to those contextually derived semantic regions. First, we propose a bi-layer sparse coding formulation for uncovering how an image or semantic region can be robustly reconstructed from the over-segmented image patches of an image set. We then harness it for the automatic label to region assignment of the entire image set. The solution to bi-layer sparse coding is achieved by convex l1-norm minimization. The underlying philosophy of bi-layer sparse coding is that an image or semantic region can be sparsely reconstructed via the atomic image patches belonging to the images with common labels, while the robustness in label propagation requires that these selected atomic patches come from very few images. Each layer of sparse coding produces the image label assignment to those selected atomic patches and merged candidate regions based on the shared image labels. The results from all bi-layer sparse codings over all candidate regions are then fused to obtain the entire label to region assignments. Besides, the presenting bi-layer sparse coding framework can be naturally applied to perform image annotation on new test images. Extensive experiments on three public image datasets clearly demonstrate the effectiveness of our proposed framework in both label to region assignment and image annotation tasks.