Hiding classification rules for data sharing with privacy preservation

  • Authors:
  • Juggapong Natwichai;Xue Li;Maria Orlowska

  • Affiliations:
  • School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia

  • Venue:
  • DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we propose a method of hiding sensitive classification rules from data mining algorithms for categorical datasets. Our approach is to reconstruct a dataset according to the classification rules that have been checked and agreed by the data owner for releasing to data sharing. Unlike the other heuristic modification approaches, firstly, our method classifies a given dataset. Subsequently, a set of classification rules is shown to the data owner to identify the sensitive rules that should be hidden. After that we build a new decision tree that is constituted only non-sensitive rules. Finally, a new dataset is reconstructed. Our experiments show that the sensitive rules can be hidden completely on the reconstructed datasets. While non-sensitive rules are still able to discovered without any side effect. Moreover, our method can also preserve high usability of reconstructed datasets.