Using the jackknife method to produce safe plots of microdata

  • Authors:
  • Jobst Heitzig

  • Affiliations:
  • Federal Statistical Office Germany, Wiesbaden, Germany

  • Venue:
  • PSD'06 Proceedings of the 2006 CENEX-SDC project international conference on Privacy in Statistical Databases
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We discuss several methods for producing plots of uni- and bivariate distributions of confidential numeric microdata so that no single value is disclosed even in the presence of detailed additional knowledge, using the jackknife method of confidentiality protection. For histograms (as for frequency tables) this is similar to adding white noise of constant amplitude to all frequencies. Decreasing the bin size and smoothing, leading to kernel density estimation in the limit, gives more informative plots which need less noise for protection. Detail can be increased by choosing the bandwidth locally. Smoothing also the noise (i.e. using correlated noise) gives more visual improvement. Additional protection comes from robustifying the kernel density estimator or plotting only classified densities as in contour plots.