Information disclosure under realistic assumptions: privacy versus optimality

Authors:
Lei Zhang;Sushil Jajodia;Alexander Brodsky
Affiliations:
George Mason University, Fairfax, VA;George Mason University, Fairfax, VA & The MITRE Corporation, Mclean, VA;George Mason University, Fairfax, VA
Venue:
Proceedings of the 14th ACM conference on Computer and communications security
Year:
2007

Citing 15
Cited 20

Security problems on inference control for SUM, MAX, and MIN queries

Journal of the ACM (JACM)
Security-control methods for statistical databases: a comparative study

ACM Computing Surveys (CSUR)
Secure databases: protection against user influence

ACM Transactions on Database Systems (TODS)
Auditing Boolean attributes

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Protecting Respondents' Identities in Microdata Release

IEEE Transactions on Knowledge and Data Engineering
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
A formal analysis of information disclosure in data exchange

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Data Privacy through Optimal k-Anonymization

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
On the complexity of optimal K-anonymity

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Simulatable auditing

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Injecting utility into anonymized datasets

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Personalized privacy preservation

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Toward privacy in public databases

TCC'05 Proceedings of the Second international conference on Theory of Cryptography

The cost of privacy: destruction of data-mining utility in anonymized data publishing

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Exclusive Strategy for Generalization Algorithms in Micro-data Disclosure

Proceeedings of the 22nd annual IFIP WG 11.3 working conference on Data and Applications Security
Anonymization-based attacks in privacy-preserving data publishing

ACM Transactions on Database Systems (TODS)
L-Cover: Preserving Diversity by Anonymity

SDM '09 Proceedings of the 6th VLDB Workshop on Secure Data Management
Privacy-Preserving Data Publishing

Foundations and Trends in Databases
Transparent anonymization: Thwarting adversaries who know the algorithm

ACM Transactions on Database Systems (TODS)
Algorithm-safe privacy-preserving data publishing

Proceedings of the 13th International Conference on Extending Database Technology
On the use of economic price theory to find the optimum levels of privacy and information utility in non-perturbative microdata anonymisation

Data & Knowledge Engineering
Restoring compromised privacy in micro-data disclosure

ASIACCS '10 Proceedings of the 5th ACM Symposium on Information, Computer and Communications Security
k-jump strategy for preserving privacy in micro-data disclosure

Proceedings of the 13th International Conference on Database Theory
Versatile publishing for privacy preservation

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Airavat: security and privacy for MapReduce

NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Understanding privacy risk of publishing decision trees

DBSec'10 Proceedings of the 24th annual IFIP WG 11.3 working conference on Data and applications security and privacy
Minimizing minimality and maximizing utility: analyzing method-based attacks on anonymized data

Proceedings of the VLDB Endowment
ASAP: Eliminating algorithm-based disclosure in privacy-preserving data publishing

Information Systems
Differentially private data release for data mining

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy streamliner: a two-stage approach to improving algorithm efficiency

Proceedings of the second ACM conference on Data and Application Security and Privacy
An information theoretic privacy and utility measure for data sanitization mechanisms

Proceedings of the second ACM conference on Data and Application Security and Privacy
k-Concealment: An Alternative Model of k-Type Anonymity

Transactions on Data Privacy
Secure distributed framework for achieving ε-differential privacy

PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

The problem of information disclosure has attracted much interest from the research community in recent years. When disclosing information, the challenge is to provide as much information as possible (optimality) while guaranteeing a desired safety property for privacy (such as l-diversity). A typical disclosure algorithm uses a sequence of disclosure schemas to output generalizations in the nonincreasing order of data utility; the algorithm releases the first generalization that satisfies the safety property. In this paper, we assert that the desired safety property cannot always be guaranteed if an adversary has the knowledge of the underlying disclosure algorithm. We propose a model for the additional information disclosed by an algorithm based on the definition of deterministic disclosure function (DDF), and provide definitions of p-safe and p-optimal DDFs. We give an analysis for the complexity to compute a p-optimal DDF. We show that deciding whether a DDF is p-optimal is an NP-hard problem, and only under specific conditions, we can solve the problem in polynomial time with respect to the size of the set of all possible database instances and the length of the disclosure generalization sequence. We then consider the problem of microdata disclosure and the safety condition of l-diversity. We relax the notion of p-optimality to weak p-optimality, and develop a weak p-optimal algorithm which is polynomial in the size of the original table and the length of the generalization sequence.