(α, k)-anonymity based privacy preservation by lossy join

  • Authors:
  • Raymond Chi-Wing Wong;Yubao Liu;Jian Yin;Zhilan Huang;Ada Wai-Chee Fu;Jian Pei

  • Affiliations:
  • Department of Computer Science and Engineering, the Chinese University of Hong Kong, Hong Kong;Department of Computer Science, Zhongshan University, China;Department of Computer Science, Zhongshan University, China;Department of Computer Science, Zhongshan University, China;Department of Computer Science and Engineering, the Chinese University of Hong Kong, Hong Kong;School of Computing Science, Simon Fraser University, Canada

  • Venue:
  • APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Privacy-preserving data publication for data mining is to protect sensitive information of individuals in published data while the distortion to the data is minimized. Recently, it is shown that (α, k)- anonymity is a feasible technique when we are given some sensitive attribute(s) and quasi-identifier attributes. In previous work, generalization of the given data table has been used for the anonymization. In this paper, we show that we can project the data onto two tables for publishing in such a way that the privacy protection for (α, k)-anonymity can be achieved with less distortion. In the two tables, one table contains the undisturbed non-sensitive values and the other table contains the undisturbed sensitive values. Privacy preservation is guaranteed by the lossy join property of the two tables. We show by experiments that the results are better than previous approaches.