Probabilistic proximity searching algorithms based on compact partitions

  • Authors:
  • Benjamin Bustos;Gonzalo Navarro

  • Affiliations:
  • Department of Computer and Information Science, University of Konstanz, Universitaetstr. 10, 78457 Konstanz, Germany;Center for Web Research, Department of Computer Science, University of Chile, Blanco Encalada 2120, Santiago, Chile

  • Venue:
  • Journal of Discrete Algorithms - SPIRE 2002
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The main bottleneck of the research in metric space searching is the so-called curse of dimensionality, which makes the task of searching some metric spaces intrinsically difficult, whatever algorithm is used. A recent trend to break this bottleneck resorts to probabilistic algorithms, where it has been shown that one can find 99% of the relevant objects at a fraction of the cost of the exact algorithm. These algorithms are welcome in most applications because resorting to metric space searching already involves a fuzziness in the retrieval requirements. In this paper, we push further in this direction by developing probabilistic algorithms on data structures whose exact versions are the best for high dimensions. As a result, we obtain probabilistic algorithms that are better than the previous ones. We give new insights on the problem and propose a novel view based on time-bounded searching. We also propose an experimental framework for probabilistic algorithms that permits comparing them in offline mode.