An efficient sanitization algorithm for balancing information privacy and knowledge discovery in association patterns mining

  • Authors:
  • En Tzu Wang;Guanling Lee

  • Affiliations:
  • Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, ROC;Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan, ROC

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Discovering frequent patterns in large databases is one of the most studied problems in data mining, since it can yield substantial commercial benefits. However, some sensitive patterns with security considerations may compromise privacy. In this paper, we aim to determine appropriate balance between need for privacy and information discovery in frequent patterns. A novel method to modify databases for hiding sensitive patterns is proposed in this paper. Multiplying the original database by a sanitization matrix yields a sanitized database with private content. In addition, two probabilities are introduced to oppose against the recovery of sensitive patterns and to reduce the degree of hiding non-sensitive patterns in the sanitized database. The complexity analysis and the security discussion of the proposed sanitization process are provided. The results from a series of experiments performed to show the efficiency and effectiveness of this approach are described.