Privacy preserving frequent itemset mining

  • Authors:
  • Stanley R. M. Oliveira;Osmar R. Zaïane

  • Affiliations:
  • Embrapa Information Technology, André Tosello, 209 - Barão Geraldo, 13083-886 - Campinas, SP, Brasil and Department of Computing Science, University of Alberta, Edmonton, AB, Canada, T6G ...;Department of Computing Science, University of Alberta, Edmonton, AB, Canada, T6G 2E8

  • Venue:
  • CRPIT '14 Proceedings of the IEEE international conference on Privacy, security and data mining - Volume 14
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

One crucial aspect of privacy preserving frequent itemset mining is the fact that the mining process deals with a trade-off: privacy and accuracy, which are typically contradictory, and improving one usually incurs a cost in the other. One alternative to address this particular problem is to look for a balance between hiding restrictive patterns and disclosing nonrestrictive ones. In this paper, we propose a new framework for enforcing privacy in mining frequent itemsets. We combine, in a single framework, techniques for efficiently hiding restrictive patterns: a transaction retrieval engine relying on an inverted file and Boolean queries; and a set of algorithms to "sanitize" a database. In addition, we introduce performance measures for mining frequent itemsets that quantify the fraction of raining patterns which are preserved after sanitizing a database. We also report the results of a performance evaluation of our research prototype and an analysis of the results.