Inferring semantic concepts from community-contributed images and noisy tags

Authors:
Jinhui Tang;Shuicheng Yan;Richang Hong;Guo-Jun Qi;Tat-Seng Chua
Affiliations:
National University of Singapore, Singapore;National University of Singapore, Singapore;National University of Singapore, Singapore;University of Illinois at Urbana-Champaign, Illinois, USA;National University of Singapore, Singapore
Venue:
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Year:
2009

Citing 26
Cited 69

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems

SIAM Journal on Scientific and Statistical Computing
Laplacian Eigenmaps for dimensionality reduction and data representation

Neural Computation
Iterative Methods for Sparse Linear Systems

Iterative Methods for Sparse Linear Systems
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Manifold-ranking based image retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Multimodal concept-dependent active learning for image retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Learning Object Categories from Google"s Image Search

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Semi-supervised learning with graphs

Semi-supervised learning with graphs
Label propagation through linear neighborhoods

ICML '06 Proceedings of the 23rd international conference on Machine learning
AnnoSearch: Image Auto-Annotation by Search

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Image annotation by large-scale content-based image retrieval

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Image annotation refinement using random walk with restarts

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Semantics, content, and structure of many for the creation of personal photo albums

Proceedings of the 15th international conference on Multimedia
Label Propagation through Linear Neighborhoods

IEEE Transactions on Knowledge and Data Engineering
From Pixels to Semantic Spaces: Advances in Content-Based Image Retrieval

Computer
Annotating Images by Mining Image Search Results

IEEE Transactions on Pattern Analysis and Machine Intelligence
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploring multimedia in a keyword space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
A novel region-based approach to visual concept modeling using web images

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Robust Face Recognition via Sparse Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News

IEEE Transactions on Multimedia
Bridging the Gap: Query by Semantic Example

IEEE Transactions on Multimedia
Video Annotation Based on Kernel Linear Neighborhood Propagation

IEEE Transactions on Multimedia
Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces

IEEE Transactions on Multimedia

Visual tag dictionary: interpreting tags with visual words

WSMC '09 Proceedings of the 1st workshop on Web-scale multimedia corpus
On the sampling of web images for learning visual concept classifiers

Proceedings of the ACM International Conference on Image and Video Retrieval
Exploring large scale data for multimedia QA: an initial study

Proceedings of the ACM International Conference on Image and Video Retrieval
Efficient large-scale image annotation by probabilistic collaborative multi-label propagation

Proceedings of the international conference on Multimedia
W2Go: a travel guidance system by automatic landmark ranking

Proceedings of the international conference on Multimedia
Dynamic captioning: video accessibility enhancement for hearing impairment

Proceedings of the international conference on Multimedia
Quantifying tag representativeness of visual content of social images

Proceedings of the international conference on Multimedia
One person labels one million images

Proceedings of the international conference on Multimedia
Towards a universal detector by mining concepts with small semantic gaps

Proceedings of the international conference on Multimedia
Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images

ACM Transactions on Intelligent Systems and Technology (TIST)
Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Mining multi-tag association for image tagging

World Wide Web
Active learning through notes data in Flickr: an effortless training data acquisition approach for object localization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Finding media illustrating events

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Web video retagging

Multimedia Tools and Applications
Video accessibility enhancement for hearing-impaired users

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Automatic image annotation with weakly labeled dataset

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Towards multi-semantic image annotation with graph regularized exclusive group lasso

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Capturing a great photo via learning from community-contributed photo collections

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Social image annotation via cross-domain subspace learning

Multimedia Tools and Applications
Social multimedia: highlighting opportunities for search and mining of multimedia data in social media applications

Multimedia Tools and Applications
Social image search with diverse relevance ranking

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Mediapedia: mining web knowledge to construct multimedia encyclopedia

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Short communication: Towards a universal detector by mining concepts with small semantic gaps

Expert Systems with Applications: An International Journal
Exploring multi-modality structure for cross domain adaptation in video concept annotation

Neurocomputing
Tag ranking by propagating relevance over tag and image graphs

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Automatic annotation of tagged content using predefined semantic concepts

Proceedings of the 18th Brazilian symposium on Multimedia and the web
Social tag alignment with image regions by sparse reconstructions

Proceedings of the 20th ACM international conference on Multimedia
Query expansion enhancement by fast binary matching

Proceedings of the 20th ACM international conference on Multimedia
Towards relevance and saliency ranking of image tags

Proceedings of the 20th ACM international conference on Multimedia
Geometric context-preserving progressive transmission in mobile visual search

Proceedings of the 20th ACM international conference on Multimedia
Semantic context learning with large-scale weakly-labeled image set

Proceedings of the 21st ACM international conference on Information and knowledge management
Attribute learning for understanding unstructured social activity

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
SIFT match verification by geometric coding for large-scale partial-duplicate web image search

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Multifaceted conceptual image indexing on the world wide web

Information Processing and Management: an International Journal
Image annotation by semi-supervised cross-domain learning with group sparsity

Journal of Visual Communication and Image Representation
Supervised sparse patch coding towards misalignment-robust face recognition

Journal of Visual Communication and Image Representation
Visual attention modeling based on short-term environmental adaption

Journal of Visual Communication and Image Representation
Social tag enrichment via automatic abstract tag refinement

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Multifeature analysis and semantic context learning for image classification

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Effective transfer tagging from image to video

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A unified framework for context assisted face clustering

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Multimedia encyclopedia construction by mining web knowledge

Signal Processing
Label-specific training set construction from web resource for image annotation

Signal Processing
Accurate off-line query expansion for large-scale mobile visual search

Signal Processing
Content-based tag propagation and tensor factorization for personalized item recommendation based on social tagging

ACM Transactions on Interactive Intelligent Systems (TiiS)
MLRank: Multi-correlation Learning to Rank for image annotation

Pattern Recognition
Personalized image recommendation and retrieval via latent SVM based model

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Robust image annotation via simultaneous feature and sample outlier pursuit

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Towards optimizing human labeling for interactive image tagging

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Picture tags and world knowledge: learning tag relations from visual semantic sources

Proceedings of the 21st ACM international conference on Multimedia
Towards efficient sparse coding for scalable image annotation

Proceedings of the 21st ACM international conference on Multimedia
Strong geometrical consistency in large scale partial-duplicate image search

Proceedings of the 21st ACM international conference on Multimedia
Robust human body segmentation based on part appearance and spatial constraint

Neurocomputing
Web media semantic concept retrieval via tag removal and model fusion

ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Correlation consistency constrained probabilistic matrix factorization for social tag refinement

Neurocomputing
Fusing inherent and external knowledge with nonlinear learning for cross-media retrieval

Neurocomputing
Learning from contextual information of geo-tagged web photos to rank personalized tourism attractions

Neurocomputing
Regularized Semi-Supervised Latent Dirichlet Allocation for visual concept learning

Neurocomputing
Scene image retrieval via re-ranking semantic and packed dense interestpoints

Neurocomputing
Using objective ground-truth labels created by multiple annotators for improved video classification: A comparative study

Computer Vision and Image Understanding
Content-Based Multimedia Retrieval Using Feature Correlation Clustering and Fusion

International Journal of Multimedia Data Engineering & Management
Large-scale multilabel propagation based on efficient sparse graph construction

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Robust image retrieval with hidden classes

Computer Vision and Image Understanding
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval

Computer Vision and Image Understanding
Mining user-contributed photos for personalized product recommendation

Neurocomputing
Automatic Abstract Tag Detection for Social Image Tag Refinement and Enrichment

Journal of Signal Processing Systems
Automated content labeling using context in email

Proceedings of the 17th International Conference on Management of Data
Efficient binary code indexing with pivot based locality sensitive clustering

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we exploit the problem of inferring images' semantic concepts from community-contributed images and their associated noisy tags. To infer the concepts more accurately, we propose a novel sparse graph-based semi-supervised learning approach for harnessing the labeled and unlabeled data simultaneously. The sparse graph constructed by datum-wise one-vs-all sparse reconstructions of all samples can remove most of the concept-unrelated links among the data, thus is more robust and discriminative than conventional graphs. More importantly, we propose an effective training label refinement strategy within this graph-based learning framework to handle the noise in the tags, by bringing in a dual regularization for both the quantity and sparsity of the noise. In addition, we construct an informative compact concept space with small semantic gap to infer the semantic concepts in this space to bridge the semantic gap. The relations among different concepts are inherently embedded in this space to help the concept inference. We conduct extensive experiments on a real-world community-contributed image database consisting of 55,615 Flickr images and associated tags. The results demonstrate the effectiveness of the proposed approaches and the capability of our method to deal with the noise in the tags. We further show that we could achieve comparable performance by inferring semantic concepts from training data with noisy tags versus training data with clean ground-truth labels.