The k closest pairs in spatial databases

  • Authors:
  • Gilberto Gutiérrez;Pablo Sáez

  • Affiliations:
  • Computer Science and Information Technologies Department, Universidad del Bío-Bío, Chillán, Chile;Computer Science and Information Technologies Department, Universidad del Bío-Bío, Chillán, Chile

  • Venue:
  • Geoinformatica
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We provide in this article a branch-and-bound algorithm that solves the problem of finding the k closest pairs of points (p,q), p驴驴驴P,q驴驴驴Q, considering two sets of points in the euclidean plane P,Q stored in external memory assuming that only one of the sets has a spatial index. This problem arises naturally in many scenarios, for instance when the set without an index is the answer to a spatial query. The main idea of our algorithm is to partition the space occupied by the set without an index into several cells or subspaces and to make use of the properties of a set of metrics defined on two Minimum Bounding Rectangles (MBRs). We evaluated our algorithm for different values of k by means of a series of experiments that considered both synthetical and real world datasets. We compared the performance of our algorithm with that of techniques that either assume that both datasets have a spatial index or that none has an index. The results show that our algorithm needs only between a 0.3 and a 35 % of the disk accesses required by such techniques. Our algorithm also shows a good scalability, both in terms of k and of the size of the data set.