Exploiting visual word co-occurrence for image retrieval

  • Authors:
  • Miaojing Shi;Xinghai Sun;Dacheng Tao;Chao Xu

  • Affiliations:
  • Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing, P.R.China, Beijing, China;Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing, P.R.China, Beijing, China;Centre for Quantum Computation and Intelligent Systems, University of Technology, Sydney, Australia, Sydney, Australia;Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing, P.R.China, Beijing, China

  • Venue:
  • Proceedings of the 20th ACM international conference on Multimedia
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bag-of-visual-words (BOVW) based image representation has received intense attention in recent years and has improved content based image retrieval (CBIR) significantly. BOVW does not consider the spatial correlation between visual words in natural images and thus biases the generated visual words towards noise when the corresponding visual features are not stable. In this paper, we construct a visual word co-occurrence table by exploring visual word co-occurrence extracted from small affine-invariant regions in a large collection of natural images. Based on this visual word co-occurrence table, we first present a novel high-order predictor to accelerate the generation of neighboring visual words. A co-occurrence matrix is introduced to refine the similarity measure for image ranking. Like the inverse document frequency (idf), it down-weights the contribution of the words that are less discriminative because of frequent co-occurrence. We conduct experiments on Oxford and Paris Building datasets, in which the ImageNet dataset is used to implement a large scale evaluation. Thorough experimental results suggest that our method outperforms the state-of-the-art, especially when the vocabulary size is comparatively small. In addition, our method is not much more costly than the BOVW model.