Privacy preserving BIRCH algorithm for clustering over vertically partitioned databases

  • Authors:
  • P. Krishna Prasad;C. Pandu Rangan

  • Affiliations:
  • Department of Computer Science and Engineering, Indian Institute of Technology, Madras, Chennai, India;Department of Computer Science and Engineering, Indian Institute of Technology, Madras, Chennai, India

  • Venue:
  • SDM'06 Proceedings of the Third VLDB international conference on Secure Data Management
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

BIRCH algorithm, introduced by Zhang et al. [15], is a well known algorithm for effectively finding clusters in a large data set. The two major components of the BIRCH algorithm are CF tree construction and global clustering. However BIRCH algorithm is basically designed as an algorithm working on a single database. We propose the first novel method for running BIRCH over a vertically partitioned data sets, distributed in two different databases in a privacy preserving manner. We first provide efficient solutions to crypto primitives such as finding minimum index in a vector sum and checking if sum of two private values exceed certain threshold limit. We then use these primitives as basic tools to arrive at secure solutions to CF tree construction and single link clustering for implementing BIRCH algorithm.