All-nearest-neighbors finding based on the Hilbert curve

  • Authors:
  • Hue-Ling Chen;Ye-In Chang

  • Affiliations:
  • Dept. of Computer Science and Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan, ROC;Dept. of Computer Science and Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan, ROC

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 12.06

Visualization

Abstract

An all-nearest-neighbors (ANN) query retrieves all nearest neighbors to all query objects. We may perform large number of one-nearest-neighbor queries to answer such an ANN query. Due to no total ordering of spatial proximity among spatial objects, the Hilbert curve approach has proposed to preserve the spatial locality. Chen and Chang have proposed a neighbor finding strategy (denoted as the CCSF strategy) based on the Hilbert curve to compute the absolute location of the neighboring blocks. However, it costs much time during the transformation between the Hilbert curve and the Peano curve. On the other hand, in the strategy based on R or R^*-trees for an ANN query, large number of unnecessary distance comparisons have to be done due to the problem of overlaps within the R-tree, resulting in many redundant disk accesses. Therefore, in this paper, we first propose the one-nearest-neighbor finding strategy directly based on the Hilbert curve (denoted as the ONHC strategy) for a one-nearest-neighbor query. By relations among orientations, orders, and quaternary numbers, we compute the relative locations of the query block and the neighboring block in the Hilbert curve. Then, the nearest neighbor of one query point can be found directly from these neighboring blocks. Next, by using our ONHC strategy, we propose the all-nearest-neighbors finding strategy based on the Hilbert curve (denoted as the ANHC strategy) for an ANN query. Finally, from the simulation result, we show that our ONHC strategy needs less response time (the CPU-time and the I/O time) than the CCSF strategy for the one-nearest-neighbor query. We also show that our ANHC strategy needs less response time than the strategy based on R^*-trees for an ANN query.