Differential privacy with compression

  • Authors:
  • Shuheng Zhou;Katrina Ligett;Larry Wasserman

  • Affiliations:
  • Seminar für Statistik, ETH Zürich, Zürich, Switzerland;Computer Science Department, Carnegie Mellon University, Pittsburgh, PA;Department of Statistics, Carnegie Mellon University, Pittsburgh, PA

  • Venue:
  • ISIT'09 Proceedings of the 2009 IEEE international conference on Symposium on Information Theory - Volume 4
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work studies formal utility and privacy guarantees for a simple multiplicative database transformation, where the data are compressed by a random linear or affine transformation, reducing the number of data records substantially, while preserving the number of original input variables. We provide an analysis framework inspired by a recent concept known as differential privacy. Our goal is to show that, despite the general difficulty of achieving the differential privacy guarantee, it is possible to publish synthetic data that are useful for a number of common statistical learning applications. This includes high dimensional sparse regression [24], principal component analysis (peA), and other statistical measures [16] based on the covariance of the initial data.