A family of enhanced (L,α)-diversity models for privacy preserving data publishing

  • Authors:
  • Xiaoxun Sun;Min Li;Hua Wang

  • Affiliations:
  • Australian Council for Educational Research, Australia;Department of Mathematics & Computing, University of Southern Queensland, Australia;Department of Mathematics & Computing, University of Southern Queensland, Australia

  • Venue:
  • Future Generation Computer Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Privacy preservation is an important issue in the release of data for mining purposes. Recently, a novel l-diversity privacy model was proposed. However, even an l-diverse data set may have some severe problems leading to the revelation of individual sensitive information. In this paper, we remedy the problem by introducing distinct (l,@a)-diversity, which, intuitively, demands that the total weight of the sensitive values in a given QI-group is at least @a, where the weight is controlled by a pre-defined recursive metric system. We provide a thorough analysis of the distinct (l,@a)-diversity and prove that the optimal distinct (l,@a)-diversity problem with its two variants entropy (l,@a)-diversity and recursive (c,l,@a)-diversity are NP-hard, and propose a top-down anonymization approach to solve the distinct (l,@a)-diversity problem with its variants. We show in the extensive experimental evaluations that the proposed methods are practical in terms of utility measurements and can be implemented efficiently.