Revisiting the VLAD image representation

  • Authors:
  • Jonathan Delhumeau;Philippe-Henri Gosselin;Hervé Jégou;Patrick Pérez

  • Affiliations:
  • INRIA, Rennes, France;INRIA, Rennes, France;INRIA, Rennes, France;Technicolor, Rennes, France

  • Venue:
  • Proceedings of the 21st ACM international conference on Multimedia
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent works on image retrieval have proposed to index images by compact representations encoding powerful local descriptors, such as the closely related VLAD and Fisher vector. By combining such a representation with a suitable coding technique, it is possible to encode an image in a few dozen bytes while achieving excellent retrieval results. This paper revisits some assumptions proposed in this context regarding the handling of "visual burstiness", and shows that ad-hoc choices are implicitly done which are not desirable. Focusing on VLAD without loss of generality, we propose to modify several steps of the original design. Albeit simple, these modifications significantly improve VLAD and make it compare favorably against the state of the art.