Automatic image tagging as a random walk with priors on the canonical correlation subspace

  • Authors:
  • Timothée Bailloeul;Caizhi Zhu;Yinghui Xu

  • Affiliations:
  • Ricoh Software Research Center (Beijing) Co., Ltd., Beijing, China;Ricoh Software Research Center (Beijing) Co., Ltd., Beijing, China;Ricoh R&D Group, Kanagawa-Ken Yokohama-Shi, Japan

  • Venue:
  • MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a graph-based scheme founded on the GCap method of Pan et al. [12] to perform automatic image annotation. Our approach, namely enhanced GCap (EGCap), takes advantage of the canonical correlation analysis technique (CCA) to shorten the semantic gap in the image space and define a new metric in the text space to correlate annotations. As a result, graph linkage errors at the image level are decreased and the consistency of tags output by the system is improved. Besides, we introduce graph link weighting techniques based on inverse document frequency and CCA metric which are proved to enhance the annotation quality. Simple and self-consistent, the present approach achieves image tagging in real time due to the lightweight Local Binary Pattern image features used, the absence of image segmentation, and the reduced size of feature vectors after CCA projection. We test the proposed approach against top-grade state-of-the-art techniques on Corel and Flickr databases, and show the effectiveness of our method in terms of per-word, per-image and processing time performance indicators.