Privacy-preserving naive Bayes classification on distributed data via semi-trusted mixers

  • Authors:
  • Xun Yi;Yanchun Zhang

  • Affiliations:
  • School of Engineering and Science, Victoria University, PO Box 14428, Melbourne City MC, Victoria 8001, Australia;School of Engineering and Science, Victoria University, PO Box 14428, Melbourne City MC, Victoria 8001, Australia

  • Venue:
  • Information Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Distributed data mining applications, such as those dealing with health care, finance, counter-terrorism and homeland defense, use sensitive data from distributed databases held by different parties. This comes into direct conflict with an individual's need and right to privacy. It is thus of great importance to develop adequate security techniques for protecting privacy of individual values used for data mining. In this paper, we consider privacy-preserving naive Bayes classifier for horizontally partitioned distributed data and propose a two-party protocol and a multi-party protocol to achieve it. Our multi-party protocol is built on the semi-trusted mixer model, in which each data site sends messages to two semi-trusted mixers, respectively, which run our two-party protocol and then broadcast the classification result. This model facilitates both trust management and implementation. Security analysis has showed that our two-party protocol is a private protocol and our multi-party protocol is a private protocol as long as the two mixers do not conclude.