A classification based framework for privacy preserving data mining

  • Authors:
  • Animesh Tripathy;Jayanti Dansana;Ranjita Mishra

  • Affiliations:
  • KIIT University;KIIT University;KIIT University

  • Venue:
  • Proceedings of the International Conference on Advances in Computing, Communications and Informatics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The information age has enabled many organizations to gather huge volumes of data. A scenario in which two parties owning confidential databases wish to run a data mining algorithm on the union of their databases, without illuminating any unnecessary information requires the protection of the privileged information. The aim of a classification problem is to classify transactions into one of a discrete set of possible categories. The secure multiparty computation problems that need to be solved at this point of time are to find the class value with the most transactions and to determine whether all the transactions have the same class attribute. In this paper we demonstrate the difference between gini index and entropy attribute measures and prove that pruning results in accuracy and privacy.