Determining error bounds for spectral filtering based reconstruction methods in privacy preserving data mining

  • Authors:
  • Songtao Guo;Xintao Wu;Yingjiu Li

  • Affiliations:
  • University of North Carolina at Charlotte, Department of Software and Information Systems, 28223, Charlotte, NC, USA;University of North Carolina at Charlotte, Department of Software and Information Systems, 28223, Charlotte, NC, USA;School of Information Systems, Singapore Management University, 28223, Singapore, Singapore

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Additive randomization has been a primary tool for hiding sensitive private information. Previous work empirically showed that individual data values can be approximately reconstructed from the perturbed values, using spectral filtering techniques. This poses a serious threat of privacy breaches. In this paper we conduct a theoretical study on how the reconstruction error varies, for different types of additive noise. In particular, we first derive an upper bound for the reconstruction error using matrix perturbation theory. Attackers who use spectral filtering techniques to estimate the true data values may leverage this bound to determine how close their estimates are to the original data. We then derive a lower bound for the reconstruction error, which can help data owners decide how much noise should be added to satisfy a given threshold of the tolerated privacy breach.