Descriptor learning for efficient retrieval

Authors:
James Philbin;Michael Isard;Josef Sivic;Andrew Zisserman
Affiliations:
Department of Engineering Science, University of Oxford;Microsoft Research, Silicon Valley;INRIA, WILLOW, Laboratoire d'Informatique de l'Ecole Normale Superieure, Paris;Department of Engineering Science, University of Oxford, Paris
Venue:
ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Year:
2010

Citing 10
Cited 17

Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Stochastic Meta-Descent for Tracking Articulated Structures

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 1 - Volume 01
Photo tourism: exploring photo collections in 3D

ACM SIGGRAPH 2006 Papers
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Kernel Codebooks for Scene Categorization

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Distance Metric Learning for Large Margin Nearest Neighbor Classification

The Journal of Machine Learning Research

Optimizing visual vocabularies using soft assignment entropies

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
Mobile visual search from dynamic image databases

SCIA'11 Proceedings of the 17th Scandinavian conference on Image analysis
Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos

MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
Exploring latent class information for image retrieval using the bag-of-feature model

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Contextual synonym dictionary for visual object retrieval

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Content-Based retrieval in endomicroscopy: toward an efficient smart atlas for clinical diagnosis

MCBR-CDS'11 Proceedings of the Second MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support
Correlation-based burstiness for logo retrieval

Proceedings of the 20th ACM international conference on Multimedia
Descriptor learning using convex optimisation

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Size matters: exhaustive geometric verification for image retrieval accepted for ECCV 2012

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Per-patch descriptor selection using surface and scene properties

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Learning to match images in large-scale collections

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
Mobile product image search by automatic query object extraction

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Robust and accurate mobile visual localization and its applications

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
Large-scale Structure-from-Motion Reconstruction with small memory consumption

Proceedings of International Conference on Advances in Mobile Computing & Multimedia
Ranking consistency for image matching and object retrieval

Pattern Recognition
Spatially aware feature selection and weighting for object retrieval

Image and Vision Computing
Automatic shape expansion with verification to improve 3D retrieval, classification and matching

3DOR '13 Proceedings of the Sixth Eurographics Workshop on 3D Object Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many visual search and matching systems represent images using sparse sets of "visual words": descriptors that have been quantized by assignment to the best-matching symbol in a discrete vocabulary. Errors in this quantization procedure propagate throughout the rest of the system, either harming performance or requiring correction using additional storage or processing. This paper aims to reduce these quantization errors at source, by learning a projection from descriptor space to a new Euclidean space in which standard clustering techniques are more likely to assign matching descriptors to the same cluster, and nonmatching descriptors to different clusters. To achieve this, we learn a non-linear transformation model by minimizing a novel margin-based cost function, which aims to separate matching descriptors from two classes of non-matching descriptors. Training data is generated automatically by leveraging geometric consistency. Scalable, stochastic gradient methods are used for the optimization. For the case of particular object retrieval, we demonstrate impressive gains in performance on a ground truth dataset: our learnt 32-D descriptor without spatial re-ranking outperforms a baseline method using 128-D SIFT descriptors with spatial re-ranking.