Accurate off-line query expansion for large-scale mobile visual search

Authors:
Ke Gao;Yongdong Zhang;Dongming Zhang;Shouxun Lin
Affiliations:
Advanced Computing Research Laboratory, Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China;Advanced Computing Research Laboratory, Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China;Advanced Computing Research Laboratory, Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China;Advanced Computing Research Laboratory, Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Venue:
Signal Processing
Year:
2013

Citing 25
Cited 0

Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Two motion-blurred images are better than one

Pattern Recognition Letters - Special issue: In memoriam Azriel Rosenfeld
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparison of Affine Region Detectors

International Journal of Computer Vision
Removing camera shake from a single photograph

ACM SIGGRAPH 2006 Papers
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions

Communications of the ACM - 50th anniversary issue: 1958 - 2008
Outdoors augmented reality on mobile phone using loxel-based visual feature organization

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Space-time tradeoffs for approximate nearest neighbor searching

Journal of the ACM (JACM)
Indexing local configurations of features for scalable content-based video copy detection

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Query expansion for hash-based image object retrieval

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
ASIFT: A New Framework for Fully Affine Invariant Image Comparison

SIAM Journal on Imaging Sciences
Affine Stable Characteristic based sample expansion for object detection

Proceedings of the ACM International Conference on Image and Video Retrieval
Feature map hashing: sub-linear indexing of appearance and global geometry

Proceedings of the international conference on Multimedia
Mobile product recognition

Proceedings of the international conference on Multimedia
SURF: speeded up robust features

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
City-scale landmark identification on mobile devices

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations

IEEE Transactions on Multimedia
Towards a Relevant and Diverse Search of Social Images

IEEE Transactions on Multimedia
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Quantified Score

Hi-index	0.08

Visualization

Abstract

Mobile visual search is a new class of applications that use images taken by camera phone to initiate search queries. It is a very challenging task mainly because of image affine transformations caused by viewpoints changes, and motion blur due to hand tremble. These problems are unavoidable in mobile visual search and often result in low recall. Query expansion is an effective strategy for recall improvement, but existing methods are highly memory and time consuming, and often involve lots of redundant features. Integrating robust local patch mining and geometric parameter coding, this paper proposes an accurate offline query expansion method for large-scale mobile visual search. Concretely, a novel criterion is presented for robust patch evaluation and mining. Then multiple representative features are extracted from these selected local patches to deal with viewpoint changes. Moreover, the geometric parameter of each representative viewpoint is also recorded, to support fast and accurate feature matching. Experimental results on several well-known datasets and a large image set (1M) have demonstrated the effectiveness and efficiency of our method, especially its high robustness to viewpoint changes. The proposed approach can also be well generalized to other multimedia content analysis tasks.