Large scale disk-based metric indexing structure for approximate information retrieval by content

  • Authors:
  • Stanislav Barton;Valerie Gouet-Brunet;Marta Rukoz

  • Affiliations:
  • CNAM/CEDRIC, Paris, Cedex;CNAM/CEDRIC, Paris, Cedex;POND University, Nanterre, France

  • Venue:
  • Proceedings of the 1st Workshop on New Trends in Similarity Search
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to achieve large scalability, indexing structures are usually distributed to incorporate more of expensive main memory during the query processing. In this paper, an indexing structure, that does not suffer from a performance degradation by its transition from main memory storage to hard drive, is proposed. The high efficiency of the index is achieved using a very effective pruning based on precomputed distances and so called locality phenomenon which substantially diminishes the number of retrieved candidates. The trade-offs for the large scalability are, firstly, the approximation and, secondly, longer query times, yet both are still bearable enough for recent multimedia content-based search systems, proved by an evaluation using visual and audio data and both metric and semi-metric distance functions. The tuning of the index's parameters based on the analysis of the particular's data intrinsic dimensionality is also discussed.