Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Scale & Affine Invariant Interest Point Detectors
International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Two motion-blurred images are better than one
Pattern Recognition Letters - Special issue: In memoriam Azriel Rosenfeld
A Performance Evaluation of Local Descriptors
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparison of Affine Region Detectors
International Journal of Computer Vision
Removing camera shake from a single photograph
ACM SIGGRAPH 2006 Papers
Scalable Recognition with a Vocabulary Tree
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Outdoors augmented reality on mobile phone using loxel-based visual feature organization
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Space-time tradeoffs for approximate nearest neighbor searching
Journal of the ACM (JACM)
Indexing local configurations of features for scalable content-based video copy detection
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Query expansion for hash-based image object retrieval
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Inferring semantic concepts from community-contributed images and noisy tags
MM '09 Proceedings of the 17th ACM international conference on Multimedia
ASIFT: A New Framework for Fully Affine Invariant Image Comparison
SIAM Journal on Imaging Sciences
Affine Stable Characteristic based sample expansion for object detection
Proceedings of the ACM International Conference on Image and Video Retrieval
Feature map hashing: sub-linear indexing of appearance and global geometry
Proceedings of the international conference on Multimedia
Proceedings of the international conference on Multimedia
SURF: speeded up robust features
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
City-scale landmark identification on mobile devices
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations
IEEE Transactions on Multimedia
Towards a Relevant and Diverse Search of Social Images
IEEE Transactions on Multimedia
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration
ACM Computing Surveys (CSUR)
Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Hi-index | 0.08 |
Mobile visual search is a new class of applications that use images taken by camera phone to initiate search queries. It is a very challenging task mainly because of image affine transformations caused by viewpoints changes, and motion blur due to hand tremble. These problems are unavoidable in mobile visual search and often result in low recall. Query expansion is an effective strategy for recall improvement, but existing methods are highly memory and time consuming, and often involve lots of redundant features. Integrating robust local patch mining and geometric parameter coding, this paper proposes an accurate offline query expansion method for large-scale mobile visual search. Concretely, a novel criterion is presented for robust patch evaluation and mining. Then multiple representative features are extracted from these selected local patches to deal with viewpoint changes. Moreover, the geometric parameter of each representative viewpoint is also recorded, to support fast and accurate feature matching. Experimental results on several well-known datasets and a large image set (1M) have demonstrated the effectiveness and efficiency of our method, especially its high robustness to viewpoint changes. The proposed approach can also be well generalized to other multimedia content analysis tasks.