Providing k-anonymity in data mining

  • Authors:
  • Arik Friedman;Ran Wolff;Assaf Schuster

  • Affiliations:
  • Computer Science Department, Technion--Israel Institute of Technology, Haifa, Israel;Computer Science Department, Technion--Israel Institute of Technology, Haifa, Israel;Management Information Systems Department, Haifa University, Haifa, Israel

  • Venue:
  • The VLDB Journal — The International Journal on Very Large Data Bases
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present extended definitions of k-anonymity and use them to prove that a given data mining model does not violate the k-anonymity of the individuals represented in the learning examples. Our extension provides a tool that measures the amount of anonymity retained during data mining. We show that our model can be applied to various data mining problems, such as classification, association rule mining and clustering. We describe two data mining algorithms which exploit our extension to guarantee they will generate only k-anonymous output, and provide experimental results for one of them. Finally, we show that our method contributes new and efficient ways to anonymize data and preserve patterns during anonymization.