A Practical Differentially Private Random Decision Tree Classifier

  • Authors:
  • Geetha Jagannathan;Krishnan Pillaipakkamnatt;Rebecca N. Wright

  • Affiliations:
  • Department of Computer Science/ Columbia University/ NY/ USA. e-mail: geetha@cs.columbia.edu;Department of Computer Science/ Hofstra University/ Hempstead/ NY/ USA. e-mail: csckzp@hofstra.edu;Department of Computer Science/ Rutgers University/ New Brunswick/ NJ/ USA. e-mail: rebecca.wright@rutgers.edu

  • Venue:
  • Transactions on Data Privacy
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we study the problem of constructing private classifiers using decision trees, within the framework of differential privacy. We first present experimental evidence that creating a differentially private ID3 tree using differentially private low-level queries does not simultaneously provide good privacy and good accuracy, particularly for small datasets. In search of better privacy and accuracy, we then present a differentially private decision tree ensemble algorithm based on random decision trees. We demonstrate experimentally that this approach yields good prediction while maintaining good privacy, even for small datasets. We also present differentially private extensions of our algorithm to two settings: (1) new data is periodically appended to an existing database and (2) the database is horizontally or vertically partitioned between multiple users.