Probability Distribution Reconstruction for Nominal Attributes in Privacy Preserving Classification

  • Authors:
  • Piotr Andruszkiewicz

  • Affiliations:
  • -

  • Venue:
  • ICHIT '08 Proceedings of the 2008 International Conference on Convergence and Hybrid Information Technology
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Concerns about privacy of data used in Data Mining have emerged recently. Users are afraid of misuse of this data and discovered knowledge. Thus several methods of preserving privacy classification have been proposed in literature. One of these methods enables miners to use continuous and nominal attributes simultaneously in classification. Reconstruction of probability distribution is an important task in privacy preserving classification for both nominal and continuous attributes which were distorted with the randomization-based technique and are stored in centralized database. We present the new algorithm - EQ - for reconstruction of probability distribution of nominal attributes, which outperforms former algorithm especially for high privacy levels. Effectiveness of the new solution (information loss in reconstruction of probability distribution of nominal attributes and accuracy of classification) has been tested and presented in this paper.