Reducing the dimensionality of the SIFT descriptor and increasing its effectiveness and efficiency in image retrieval via bag-of-features

  • Authors:
  • Glauco Vitor Pedrosa;Solange Oliveira Rezende;Agma Juci Machado Traina

  • Affiliations:
  • Universidade de São Paulo, São Carlos, Brazil;Universidade de São Paulo, São Carlos, Brazil;Universidade de São Paulo, São Carlos, Brazil

  • Venue:
  • Proceedings of the 18th Brazilian symposium on Multimedia and the web
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Bag-of-Features is a popular approach to describe multimedia information by using visual words. The SIFT (Scale Invariant Feature Transform) is one of the most utilized descriptor to model multimedia information in Bag of-Features. The data is described as a set of keypoints and a feature vector is assigned for each of the keypoints. This feature vector is composed of 128 values, which represent the region around each keypoint. In general, some of the detected keypoints are not relevant and can be discarded without losing the local discriminative power. In this paper, we propose a technique to reduce the detected keypoints by SIFT, as well as a technique to reduce the feature vector dimensionality. Experiments were made in order to analyze the performance of the proposed reduction techniques using two different image databases. The results demonstrated that the proposed techniques improve the performance of the image retrieval by reducing up to 50% the feature vector dimensionality of SIFT and at the same time providing a gain of computational time of modeling an image employing Bag-of-Features.