Automatic image tagging as a random walk with priors on the canonical correlation subspace

Authors:
Timothée Bailloeul;Caizhi Zhu;Yinghui Xu
Affiliations:
Ricoh Software Research Center (Beijing) Co., Ltd., Beijing, China;Ricoh Software Research Center (Beijing) Co., Ltd., Beijing, China;Ricoh R&D Group, Kanagawa-Ken Yokohama-Shi, Japan
Venue:
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Year:
2008

Citing 9
Cited 8

Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
GCap: Graph-based Automatic Image Captioning

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 9 - Volume 09
An adaptive graph model for automatic image annotation

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Fast Random Walk with Restart and Its Applications

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Supervised Learning of Semantic Classes for Image Annotation and Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Real-Time Computerized Annotation of Pictures

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A correlation approach for automatic image annotation

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications

Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Mining multi-tag association for image tagging

World Wide Web
Tagging image by exploring weighted correlation between visual features and tags

WAIM'11 Proceedings of the 12th international conference on Web-age information management
Image tagging by exploiting feature correlation

ICADL'11 Proceedings of the 13th international conference on Asia-pacific digital libraries: for cultural heritage, knowledge dissemination, and future creation
SympGraph: a framework for mining clinical notes through symptom relation graphs

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Improving image tags by exploiting web search results

Multimedia Tools and Applications
A spatio-temporal pyramid matching for video retrieval

Computer Vision and Image Understanding
Social image tagging using graph-based reinforcement on multi-type interrelated objects

Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a graph-based scheme founded on the GCap method of Pan et al. [12] to perform automatic image annotation. Our approach, namely enhanced GCap (EGCap), takes advantage of the canonical correlation analysis technique (CCA) to shorten the semantic gap in the image space and define a new metric in the text space to correlate annotations. As a result, graph linkage errors at the image level are decreased and the consistency of tags output by the system is improved. Besides, we introduce graph link weighting techniques based on inverse document frequency and CCA metric which are proved to enhance the annotation quality. Simple and self-consistent, the present approach achieves image tagging in real time due to the lightweight Local Binary Pattern image features used, the absence of image segmentation, and the reduced size of feature vectors after CCA projection. We test the proposed approach against top-grade state-of-the-art techniques on Corel and Flickr databases, and show the effectiveness of our method in terms of per-word, per-image and processing time performance indicators.