Similarity search in metric databases through hashing

  • Authors:
  • Claudio Gennaro;Pasquale Savino;Pavel Zezula

  • Affiliations:
  • IEI-CNR, Pisa, Italy;IEI-CNR, Pisa, Italy;Masaryk Univ., Brno, Czech Republic

  • Venue:
  • MULTIMEDIA '01 Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrieval
  • Year:
  • 2001

Quantified Score

Hi-index 0.06

Visualization

Abstract

A novel access structure for similarity search in metric databases, called Similarity Hashing (SH), is proposed. It is a multi-level hash structure, consisting of search-separable bucket sets on each level. The structure supports easy insertion and bounded search costs, because at most one bucket needs to be accessed at each level for range queries up to a pre-defined value of search radius. At the same time, the pivot-based strategy significantly reduces the number of distance computations. Contrary to tree organizations, the SH structure is suitable for distributed and parallel implementations.