A theoretical basis for perturbation methods

  • Authors:
  • Krishnamurty Muralidhar;Rathindra Sarathy

  • Affiliations:
  • School of Management, Gatton College of Business & Economics, University of Kentucky, Lexington, KY 40506-0034, USA. krishm@pop.uky.edu;Department of Management Science & Information Systems, Oklahoma State University, Stillwater, OK 74078, USA. sarathy@okstate.edu

  • Venue:
  • Statistics and Computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we discuss a new theoretical basis for perturbation methods. In developing this new theoretical basis, we define the ideal measures of data utility and disclosure risk. Maximum data utility is achieved when the statistical characteristics of the perturbed data are the same as that of the original data. Disclosure risk is minimized if providing users with microdata access does not result in any additional information. We show that when the perturbed values of the confidential variables are generated as independent realizations from the distribution of the confidential variables conditioned on the non-confidential variables, they satisfy the data utility and disclosure risk requirements. We also discuss the relationship between the theoretical basis and some commonly used methods for generating perturbed values of confidential numerical variables.