Efficient Peer-to-Peer Similarity Query Processing for High-dimensional Data

  • Authors:
  • Ye Yuan;Guoren Wang;Yongjiao Sun

  • Affiliations:
  • -;-;-

  • Venue:
  • APWEB '10 Proceedings of the 2010 12th International Asia-Pacific Web Conference
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Objects, such as a digital image, a text document or a DNA sequence are usually represented in a high dimensional feature space. A fundamental issue in (peer-to-peer) P2P systems is to support an efficient similarity search for high-dimensional data in metric spaces. Prior works suffer from some fundamental limitations, such as being not adaptive to a highly dynamic network, poor search efficiency under skewed data scenarios, large maintenance overhead and etc. In this study, we propose an efficient scheme, Dragon, to support P2P similarity search in metric spaces. Dragon achieves the efficiency through the following designs: 1) Dragon is based on our previous designed P2P network, Phoenix, which has the optimal routing efficiency in dynamic scenarios. 2) We design a locality-preserving naming algorithm and a routing tree for each peer in Phoenix to support range queries. A radius-estimated method is proposed to transform a kNN query to a range query. 3) A load-balancing algorithm is given to support strong query processing under skewed data distributions. Extensive experiments verify the superiority of Dragon over existing works.