High-dimensional kNN joins with incremental updates

  • Authors:
  • Cui Yu;Rui Zhang;Yaochun Huang;Hui Xiong

  • Affiliations:
  • Monmouth University, West Long Branch, USA 07764;University of Melbourne, Carlton, Australia 3053;University of Texas - Dallas, Dallas, USA 75080;Rutgers, the State University of New Jersey, Newark, USA 07102

  • Venue:
  • Geoinformatica
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The k Nearest Neighbor (kNN) join operation associates each data object in one data set with its k nearest neighbors from the same or a different data set. The kNN join on high-dimensional data (high-dimensional kNN join) is a very expensive operation. Existing high-dimensional kNN join algorithms were designed for static data sets and therefore cannot handle updates efficiently. In this article, we propose a novel kNN join method, named kNNJoin +, which supports efficient incremental computation of kNN join results with updates on high-dimensional data. As a by-product, our method also provides answers for the reverse kNN queries with very little overhead. We have performed an extensive experimental study. The results show the effectiveness of kNNJoin+ for processing high-dimensional kNN joins in dynamic workloads.