Replication, load balancing and efficient range query processing in DHTs

  • Authors:
  • Theoni Pitoura;Nikos Ntarmos;Peter Triantafillou

  • Affiliations:
  • Research Academic Computer Technology Institute, and Computer Engineering, and Informatics Department, University of Patras, Greece;Research Academic Computer Technology Institute, and Computer Engineering, and Informatics Department, University of Patras, Greece;Research Academic Computer Technology Institute, and Computer Engineering, and Informatics Department, University of Patras, Greece

  • Venue:
  • EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the conflicting problems of ensuring data-access load balancing and efficiently processing range queries on peer-to-peer data networks maintained over Distributed Hash Tables (DHTs). Placing consecutive data values in neighboring peers is frequently used in DHTs since it accelerates range query processing. However, such a placement is highly susceptible to load imbalances, which are preferably handled by replicating data (since replication also introduces fault tolerance benefits). In this paper, we present HotRoD, a DHT-based architecture that deals effectively with this combined problem through the use of a novel locality-preserving hash function, and a tunable data replication mechanism which allows trading off replication costs for fair load distribution. Our detailed experimentation study shows strong gains in both range query processing efficiency and data-access load balancing, with low replication overhead. To our knowledge, this is the first work that concurrently addresses the two conflicting problems using data replication.