Towards low-perturbation anonymity preserving pattern discovery

  • Authors:
  • Maurizio Atzori;Francesco Bonchi;Fosca Giannott;Dino Pedreschi

  • Affiliations:
  • ISTI - CNR, Area della Ricerca di Pisa, Pisa, Italy and University of Pisa, Largo Pontecorvo, Pisa, Italy;ISTI - CNR, Area della Ricerca di Pisa, Pisa, Italy;ISTI - CNR, Area della Ricerca di Pisa, Pisa, Italy;University of Pisa, Largo Pontecorvo, Pisa, Italy

  • Venue:
  • Proceedings of the 2006 ACM symposium on Applied computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is generally believed that data mining results do not violate the anonymity of the individuals recorded in the source database. In fact, data mining models and patterns, in order to ensure a required statistical significance, represent a large number of individuals and thus conceal individual identities: this is the case of the minimum support threshold in association rule mining. We have recently shown [3], that the above belief is ill-founded: by shifting the concept of k-anonymity [8] from data to patterns, we have formally characterized the notion of a threat to anonymity in the context of frequent itemsets mining, and provided a methodology to efficiently and effectively identify such threats that might arise from the disclosure of a set of frequent itemsets. In our previous paper [2] we have introduced a first, naïve strategy (named suppressive) to sanitize such threats. In this paper we develop a novel sanitization strategy, named additive, which outperforms the previous one in terms of the introduced distortion and has the interesting feature of maintaining the original set of frequent itemsets unchanged, while modifying only the corresponding support values.