A log square average case algorithm to make insertions in fast similarity search

  • Authors:
  • Luisa Micó;Jose Oncina

  • Affiliations:
  • Dept. Lenguajes y Sistemas Informáticos, Universidad de Alicante, E-03071 Alicante, Spain;Dept. Lenguajes y Sistemas Informáticos, Universidad de Alicante, E-03071 Alicante, Spain

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2012

Quantified Score

Hi-index 0.10

Visualization

Abstract

To speed up similarity based searches many indexing techniques have been proposed in order to address the problem of efficiency. However, most of the proposed techniques do not admit fast insertion of new elements once the index is built. The main effect is that changes in the environment are very costly to be taken into account. In this work, we propose a new technique to allow fast insertions of elements in a family of static tree-based indexes. Unlike other techniques, the resulting index is exactly equal to the index that would be obtained by building it from scratch. Therefore there is no performance degradation in search time. We show that the expected number of distance computations (and the average time complexity) is bounded by a function that grows with log^2(n) where n is the size of the database. In order to check the correctness of our approach some experiments with artificial and real data are carried out.