Processing multi-way spatial joins on map-reduce

  • Authors:
  • Himanshu Gupta;Bhupesh Chawda;Sumit Negi;Tanveer A. Faruquie;L. V. Subramaniam;Mukesh Mohania

  • Affiliations:
  • IBM India Research Laboratory, New Delhi, India;IBM India Research Laboratory, New Delhi, India;IBM India Research Laboratory, New Delhi, India;IBM India Research Laboratory, New Delhi, India;IBM India Research Laboratory, New Delhi, India;IBM India Research Laboratory, New Delhi, India

  • Venue:
  • Proceedings of the 16th International Conference on Extending Database Technology
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we investigate the problem of processing multi-way spatial joins on map-reduce platform. We look at two common spatial predicates - overlap and range. We address these two classes of join queries, discuss the challenges and outline novel approaches for executing these queries on a map-reduce framework. We then discuss how we can process join queries involving both overlap and range predicates. Specifically we present a Controlled-Replicate framework using which we design the approaches presented in this paper. The Controlled-Replicate framework is carefully engineered to minimize the communication among cluster nodes. Through experimental evaluations we discuss the complexity of the problem under investigation, details of Controlled-Replicate framework and demonstrate that the proposed approaches comfortably outperform naive approaches.