Counting Distance Permutations

  • Authors:
  • Matthew Skala

  • Affiliations:
  • -

  • Venue:
  • SISAP '08 Proceedings of the First International Workshop on Similarity Search and Applications (sisap 2008)
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A distance permutation index supports fast proximity searching in a high-dimensional metric space. Given some fixed reference sites, for each point in a database the index stores a permutation naming the closest site, the second-closest, and so on. We examine how many distinct permutations can occur as a function of the number of sites and the size of the space. We give theoretical results for tree metrics and vector spaces with L_1$, L_2$, and L_8$ metrics, improving on the previous best known storage space in the vector case. We also give experimental results and commentary on the number of distance permutations that actually occur in a variety of vector, string, and document spaces.