Contextual synonym dictionary for visual object retrieval

Authors:
Wenbin Tang;Rui Cai;Zhiwei Li;Lei Zhang
Affiliations:
Tsinghua University, Beijing, China;Microsoft Research Asia, Beijing, China;Microsoft Research Asia, Beijing, China;Microsoft Research Asia, Beijing, China
Venue:
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Year:
2011

Citing 18
Cited 2

Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Locality-sensitive hashing scheme based on p-stable distributions

SCG '04 Proceedings of the twentieth annual symposium on Computational geometry
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Near-duplicate keyframe retrieval with visual keywords and semantic context

Proceedings of the 6th ACM international conference on Image and video retrieval
Local invariant feature detectors: a survey

Foundations and Trends® in Computer Graphics and Vision
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Descriptive visual words and visual phrases for image applications

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Improving Bag-of-Features for Large Scale Image Search

International Journal of Computer Vision
Visual Word Ambiguity

IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Building contextual visual vocabulary for large-scale image applications

Proceedings of the international conference on Multimedia
Landmark image retrieval using visual synonyms

Proceedings of the international conference on Multimedia
Learning a fine vocabulary

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Descriptor learning for efficient retrieval

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Bridging the Semantic Gap Between Image Contents and Tags

IEEE Transactions on Multimedia

Exploiting visual word co-occurrence for image retrieval

Proceedings of the 20th ACM international conference on Multimedia
Spatially aware feature selection and weighting for object retrieval

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we study the problem of visual object retrieval by introducing a dictionary of contextual synonyms to narrow down the semantic gap in visual word quantization. The basic idea is to expand a visual word in the query image with its synonyms to boost the retrieval recall. Unlike the existing work such as soft-quantization, which only focuses on the Euclidean (l2) distance in descriptor space, we utilize the visual words which are more likely to describe visual objects with the same semantic meaning by identifying the words with similar contextual distributions (i.e. contextual synonyms). We describe the contextual distribution of a visual word using the statistics of both co-occurrence and spatial information averaged over all the image patches having this visual word, and propose an efficient system implementation to construct the contextual synonym dictionary for a large visual vocabulary. The whole construction process is unsupervised and the synonym dictionary can be naturally integrated into a standard bag-of-feature image retrieval system. Experimental results on several benchmark datasets are quite promising. The contextual synonym dictionary-based expansion consistently outperforms the l2 distance-based soft-quantization, and advances the state-of-the-art performance remarkably.