Visual tag dictionary: interpreting tags with visual words

Authors:
Meng Wang;Kuiyuan Yang;Xian-Sheng Hua;Hong-Jiang Zhang
Affiliations:
Microsoft Research Asia, Beijing, China;University of Science and Technology of China, Hefei, China;Microsoft Research Asia, Beijing, China;Microsoft Advanced Technology Center, Beijing, China
Venue:
WSMC '09 Proceedings of the 1st workshop on Web-scale multimedia corpus
Year:
2009

Citing 18
Cited 18

Stochastic Complexity in Statistical Inquiry Theory

Stochastic Complexity in Statistical Inquiry Theory
Cumulated gain-based evaluation of IR techniques

ACM Transactions on Information Systems (TOIS)
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
To search or to label?: predicting the performance of search-based automatic image classifiers

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
VirtualTour: an online travel assistant based on high quality images

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Discriminative training of GMM for speaker identification

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Web 2.0 dictionary

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Real-Time Computerized Annotation of Pictures

IEEE Transactions on Pattern Analysis and Machine Intelligence
Flickr distance

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Resolving tag ambiguity

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Learning tag relevance by neighbor voting for social image retrieval

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Visual diversification of image search results

Proceedings of the 18th international conference on World wide web
Tag ranking

Proceedings of the 18th international conference on World wide web
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Beyond distance measurement: constructing neighborhood similarity for video annotation

IEEE Transactions on Multimedia - Special section on communities and media computing

Event driven summarization for web videos

WSM '09 Proceedings of the first SIGMM workshop on Social media
Exploring large scale data for multimedia QA: an initial study

Proceedings of the ACM International Conference on Image and Video Retrieval
An integrated aurora image retrieval system: AuroraEye

Journal of Visual Communication and Image Representation
Representative views re-ranking for 3D model retrieval with multi-bipartite graph reinforcement model

Proceedings of the international conference on Multimedia
3D object retrieval with bag-of-region-words

Proceedings of the international conference on Multimedia
Surfing on artistic documents with visually assisted tagging

Proceedings of the international conference on Multimedia
Intelligent query: open another door to 3d object retrieval

Proceedings of the international conference on Multimedia
Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Social image annotation via cross-domain subspace learning

Multimedia Tools and Applications
Social multimedia: highlighting opportunities for search and mining of multimedia data in social media applications

Multimedia Tools and Applications
Social image search with diverse relevance ranking

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Mediapedia: mining web knowledge to construct multimedia encyclopedia

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Learning cooking techniques from youtube

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Video reference: a video question answering engine

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Optimizing social image search with multiple criteria: Relevance, diversity, and typicality

Neurocomputing
Constructing visual tag dictionary by mining community-contributed media corpus

Neurocomputing
Multimedia encyclopedia construction by mining web knowledge

Signal Processing
Picture tags and world knowledge: learning tag relations from visual semantic sources

Proceedings of the 21st ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Visual-word based image representation has shown effectiveness in a wide variety of applications such as categorization, annotation and search. By detecting keypoints in images and treating their patterns as visual words, an image can be represented as a bag of visual words, which is analogous to the bag-of-words representation of text documents. In this paper, we introduce a corpus named visual tag dictionary. Unlike the conventional dictionaries that define terms with textual words, the visual tag dictionary interprets each tag with visual words. The dictionary is constructed in a fully automatic way by exploring the tagged image data on the Internet. With this dictionary, tags and images are connected via visual words and many applications can be thus facilitated. As examples, we empirically demonstrate the effectiveness of the dictionary in tag-based image search, tag ranking and image annotation.