Speeding Up Permutation Based Indexing with Indexing

Authors:
Karina Figueroa;Kimmo Frediksson
Affiliations:
-;-
Venue:
SISAP '09 Proceedings of the 2009 Second International Workshop on Similarity Search and Applications
Year:
2009

Citing 0
Cited 5

Symbolic regression using nearest neighbor indexing

Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Fast k-NN classifier for documents based on a graph structure

CIARP'10 Proceedings of the 15th Iberoamerican congress conference on Progress in pattern recognition, image analysis, computer vision, and applications
Succinct nearest neighbor search

Proceedings of the Fourth International Conference on SImilarity Search and APplications
Versatile probability-based indexing for approximate similarity search

Proceedings of the Fourth International Conference on SImilarity Search and APplications
Parallel approaches to permutation-based indexing using inverted files

SISAP'12 Proceedings of the 5th international conference on Similarity Search and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

A recent probabilistic approach for searching in high dimensional metric spaces is based on predicting the distances between database elements according to how they order their distances towards some set of distinguished elements, called permutants. In the preprocessing phase a set of permutants is chosen, and are sorted (permuted) by their distances against every database element. The permutations form the index. When a query is given, its corresponding permutation is computed, and --- as similar elements will (probably) have a similar permutation --- the database is compared in the order induced by the similarity between permutations. This works well but has relatively high CPU time due to computing the distances between permutations and (partially) sorting the database by the similarity. We improve this by identifying and solving this as another metric space problem. This avoids many distance computations between the permutants. The experimental results show that this works extremely well in practice.