k-Anonymous Decision Tree Induction

  • Authors:
  • Arik Friedman;Assaf Schuster;Ran Wolff

  • Affiliations:
  • Computer Science Dept., Technion – Israel Institute of Technology;Computer Science Dept., Technion – Israel Institute of Technology;Computer Science Dept., Technion – Israel Institute of Technology

  • Venue:
  • PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we explore an approach to privacy preserving data mining that relies on the k-anonymity model. The k-anonymity model guarantees that no private information in a table can be linked to a group of less than k individuals. We suggest extended definitions of k-anonymity that allow the k-anonymity of a data mining model to be determined. Using these definitions, we present decision tree induction algorithms that are guaranteed to maintain k-anonymity of the learning examples. Experiments show that embedding anonymization within the decision tree induction process provides better accuracy than anonymizing the data first and inducing the tree later.