The Applicability of the Perturbation Model-based Privacy Preserving Data Mining for Real-world Data

  • Authors:
  • Li Liu;Bhavani Thuraisingham

  • Affiliations:
  • University of Texas at Dallas;University of Texas at Dallas

  • Venue:
  • ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Perturbation method is a very important technique in privacy preserving data mining. In this technique, loss of information versus preservation of privacy is always a trade off. The question is, how much are the users willing to compromise their privacy? This is a choice that changes from individual to individual. In this paper, we propose an individually adaptable perturbation model, which enables the individuals to choose their own privacy level. Hence our model provides different privacy guarantees for different privacy preferences. We test our new perturbation model by applying different reconstruction methods to the perturbed data sets. Furthermore, we build decision tree and Naive Bayes classifier models on the reconstructed data sets both for synthetic and real world data sets. For the synthetic data set, our experimental results indicate that our model enables the users to choose their own privacy level without reducing the accuracy of the data mining results. For the real world data sets, we got very interesting results, hence we pose the question of whether the perturbation reconstruction model-based privacy preserving data mining is applicable for real-world data?