Hiding Sensitive Associative Classification Rule by Data Reduction

  • Authors:
  • Juggapong Natwichai;Maria E. Orlowska;Xingzhi Sun

  • Affiliations:
  • School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia;School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia

  • Venue:
  • ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

When data sharing becomes necessary, there is a dilemma in preserving privacy. On one hand sensitive patterns such as classification rules should be hidden from being discovered. On the other hand, hiding the sensitive patterns may affect the data quality. In this paper, we present our studies on the sensitive classification rule hiding problem by data reduction approach, i.e., removing the whole selected records. In our work, we focus on a particular type of classification rule, called canonical associative classification rule. And, the impact on data quality is evaluated in terms of the number of affected non-sensitive rules. We present the observations on the data quality based on a geometric model. According to the observations, we can show the impact precisely without any re-computing. This helps to improve the hiding algorithms from both effectiveness and efficiency perspective. Additionally, we present the algorithmic steps to demonstrate the removal of the records so that the impact on the data quality is potentially minimal. Finally, we conclude our work and outline future work directions for this problem.