A metric cache for similarity search

  • Authors:
  • Fabrizio Falchi;Claudio Lucchese;Salvatore Orlando;Raffaele Perego;Fausto Rabitti

  • Affiliations:
  • ISTI-CNR, Pisa, Italy;ISTI-CNR, Pisa, Italy;Università Ca' Foscari, Venezia, Italy;ISTI-CNR, Pisa, Italy;ISTI-CNR, Pisa, Italy

  • Venue:
  • Proceedings of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Similarity search in metric spaces is a general paradigm that can be used in several application fields. It can also be effectively exploited in content-based image retrieval systems, which are shifting their target towards the Web-scale dimension. In this context, an important issue becomes the design of scalable solutions, which combine parallel and distributed architectures with caching at several levels. To this end, we investigate the design of a similarity cache that works in metric spaces. It is able to answer with exact and approximate results: even when an exact match is not present in cache, our cache may return an approximate result set with quality guarantees. By conducting tests on a collection of one million high-quality digital photos, we show that the proposed caching techniques can have a significant impact on performance, like caching on text queries has been proved effective for traditional Web search engines.