Cost models for nearest neighbor query processing over existentially uncertain spatial data

  • Authors:
  • Elias Frentzos;Nikos Pelekis;Nikos Giatrakos;Yannis Theodoridis

  • Affiliations:
  • Department of Informatics, University of Piraeus, Piraeus, Greece;Department of Statistics & Insurance Science, University of Piraeus, Piraeus, Greece;Dept. of Electronics & Computer Engineering, Technical University of Crete, Crete, Greece;Department of Informatics, University of Piraeus, Piraeus, Greece

  • Venue:
  • SSTD'13 Proceedings of the 13th international conference on Advances in Spatial and Temporal Databases
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A major challenge posed by real-world applications involving spatial information deals with the uncertainty inherent in the data. One type of uncertainty in spatial objects may come from their existence, which is expressed by a probability accompanying the spatial value of an object reflecting the confidence of the object's existence. A challenging query type over existentially uncertain data is the search of the Nearest Neighbour (NN), as the likelihood of an object to be the NN of the query object does not only depend on its distances from other objects, but also from their existence. In this paper, we present exact and approximate statistical methodologies for supporting cost models for Probabilistic Thresholding NN (PTNN) queries that deal with arbitrarily distributed data points and existential uncertainty, with the aid of appropriate novel histograms, sampling and statistical approximations. Our cost model can be also modified in order to support Probabilistic Ranking NN (PRNN) queries with the aid of sampling. The accuracy of our approaches is exhibited through extensive experimentation on synthetic and real datasets.