Measuring risk and utility of anonymized data using information theory

  • Authors:
  • Josep Domingo-Ferrer;David Rebollo-Monedero

  • Affiliations:
  • Universitat Rovira i Virgili, Tarragona, Catalonia;Technical University of Catalonia, Barcelona, Catalonia

  • Venue:
  • Proceedings of the 2009 EDBT/ICDT Workshops
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Before releasing anonymized microdata (individual data) it is essential to evaluate whether: i) their utility is high enough for their release to make sense; ii) the risk that the anonymized data result in disclosure of respondent identity or respondent attribute values is low enough. Utility and disclosure risk measures are used for the above evaluation, which normally lack a common theoretical framework allowing to trade off utility and risk in a consistent way. We explore in this paper the use of information-theoretic measures based on the notion of mutual information.