Joining privately on outsourced data

  • Authors:
  • Bogdan Carbunar;Radu Sion

  • Affiliations:
  • Applied Research Center, Motorola Labs, Schaumburg, IL;Computer Science, Stony Brook University, Stony Brook, NY

  • Venue:
  • SDM'10 Proceedings of the 7th VLDB conference on Secure data management
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In an outsourced database framework, clients place data management with specialized service providers. Of essential concern in such frameworks is data privacy. Potential clients are reluctant to outsource sensitive data to a foreign party without strong privacy assurances beyond policy "fine-prints". In this paper we introduce a mechanismfor executing general binary JOIN operations (for predicates that satisfy certain properties) in an outsourced relational database framework with full computational privacy and low overheads - a first, to the best of our knowledge. We illustrate via a set of relevant instances of JOIN predicates, including: range and equality (e.g., for geographical data), Hamming distance (e.g., for DNA matching) and semantics (i.e., in health-care scenarios - mapping antibiotics to bacteria). We experimentally evaluate the main overhead components and show they are reasonable. For example, the initial client computation overhead for 100000 data items is around 5 minutes. Moreover, our privacy mechanisms can sustain theoretical throughputs of over 30 million predicate evaluations per second, even for an unoptimized OpenSSL based implementation.