Utility-driven anonymization in data publishing

  • Authors:
  • Mingqiang Xue;Panagiotis Karras;Chedy Raïssi;Hung Keng Pung

  • Affiliations:
  • National University of Singapore, Singapore, Singapore;Rutgers University, Newark, NJ, USA;INRIA, Nancy Grand-Est, France;National University of Singapore, Singapore, Singapore

  • Venue:
  • Proceedings of the 20th ACM international conference on Information and knowledge management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Privacy-preserving data publication has been studied intensely in the past years. Still, all existing approaches transform data values by random perturbation or generalization. In this paper, we introduce a radically different data anonymization methodology. Our proposal aims to maintain a certain amount of patterns, defined in terms of a set of properties of interest that hold for the original data. Such properties are represented as linear relationships among data points. We present an algorithm that generates a set of anonymized data that strictly preserves these properties, thus maintaining specified patterns in the data. Extensive experiments with real and synthetic data show that our algorithm is efficient, and produces anonymized data that affords high utility in several data analysis tasks while safeguarding privacy.