Approximate algorithms for K-anonymity

Authors:
Hyoungmin Park;Kyuseok Shim
Affiliations:
Seoul National University, Seoul, South Korea;Seoul National University, Seoul, South Korea
Venue:
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Year:
2007

Citing 12
Cited 38

Generalizing data to provide anonymity when disclosing information (abstract)

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Introduction to algorithms

Introduction to algorithms
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Data Privacy through Optimal k-Anonymization

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
On the complexity of optimal K-anonymity

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
On k-anonymity and the curse of dimensionality

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Mondrian Multidimensional K-Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Anonymizing tables

ICDT'05 Proceedings of the 10th international conference on Database Theory

Dynamic anonymization: accurate statistical analysis with privacy preservation

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Preservation of proximity privacy in publishing numerical sensitive data

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
The cost of privacy: destruction of data-mining utility in anonymized data publishing

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy-preserving anonymization of set-valued data

Proceedings of the VLDB Endowment
Table summarization with the help of domain lattices

Proceedings of the 17th ACM conference on Information and knowledge management
AlphaSum: size-constrained table summarization using value lattices

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
A framework for efficient data anonymization under privacy and accuracy constraints

ACM Transactions on Database Systems (TODS)
On the Approximability of Geometric and Geographic Generalization and the Min-Max Bin Covering Problem

WADS '09 Proceedings of the 11th International Symposium on Algorithms and Data Structures
Privacy-Preserving Data Publishing

Foundations and Trends in Databases
StarClique: guaranteeing user privacy in social networks against intersection attacks

Proceedings of the 5th international conference on Emerging networking experiments and technologies
Transparent anonymization: Thwarting adversaries who know the algorithm

ACM Transactions on Database Systems (TODS)
The hardness and approximation algorithms for l-diversity

Proceedings of the 13th International Conference on Extending Database Technology
Algorithm-safe privacy-preserving data publishing

Proceedings of the 13th International Conference on Extending Database Technology
A practice-oriented framework for measuring privacy and utility in data sanitization systems

Proceedings of the 2010 EDBT/ICDT Workshops
The k-anonymity problem is hard

FCT'09 Proceedings of the 17th international conference on Fundamentals of computation theory
Efficient Anonymizations with Enhanced Utility

Transactions on Data Privacy
Clustering with diversity

ICALP'10 Proceedings of the 37th international colloquium conference on Automata, languages and programming
Resolving the complexity of some data privacy problems

ICALP'10 Proceedings of the 37th international colloquium conference on Automata, languages and programming: Part II
Theory of privacy and anonymity

Algorithms and theory of computation handbook
Extending l-diversity to generalize sensitive data

Data & Knowledge Engineering
Output privacy in data mining

ACM Transactions on Database Systems (TODS)
Local and global recoding methods for anonymizing set-valued data

The VLDB Journal — The International Journal on Very Large Data Bases
Anonymizing Set-Valued Social Data

GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
ASAP: Eliminating algorithm-based disclosure in privacy-preserving data publishing

Information Systems
Parameterized complexity of k-anonymity: hardness and tractability

IWOCA'10 Proceedings of the 21st international conference on Combinatorial algorithms
Anonymizing shortest paths on social network graphs

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Anonymizing binary and small tables is hard to approximate

Journal of Combinatorial Optimization
On the complexity of the l-diversity problem

MFCS'11 Proceedings of the 36th international conference on Mathematical foundations of computer science
On t-closeness with KL-divergence and semantic privacy

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Limiting disclosure of sensitive data in sequential releases of databases

Information Sciences: an International Journal
Secure distributed computation of anonymized views of shared databases

ACM Transactions on Database Systems (TODS)
A practical approximation algorithm for optimal k-anonymity

Data Mining and Knowledge Discovery
k-Concealment: An Alternative Model of k-Type Anonymity

Transactions on Data Privacy
Trading privacy for information loss in the blink of an eye

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
k-subscription: privacy-preserving microblogging browsing through obfuscation

Proceedings of the 29th Annual Computer Security Applications Conference
The effect of homogeneity on the computational complexity of combinatorial data anonymization

Data Mining and Knowledge Discovery
The l-Diversity problem: Tractability and approximability

Theoretical Computer Science
K-anonymous path privacy on social graphs

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

When a table containing individual data is published, disclosure of sensitive information should be prohibitive. A naive approach for the problem is to remove identifiers such as name and social security number. However, linking attacks which joins the published table with other tables on some attributes, called quasi-identifier, may reveal the sensitive information. To protect privacy against linking attack, the notion of k-anonymity which makes each record in the table be indistinguishable with k-1 other records has been proposed previously. It is shown to be NP-Hard to k-anonymize a table minimizing the number of suppressed cells. To alleviate this, O(k log k)-approximation and O(k)-approximation algorithms were proposed in previous works. In this paper, we propose several approximation algorithms that guarantee O(log k)-approximation ratio and perform significantly better than the traditional algorithms. We also provide O(ß log k)-approximate algorithms which gracefully adjust their running time according to the tolerance é (≥ 1) of the approximation ratios. Experimental results confirm that our approximation algorithms perform significantly better than traditional approximation algorithms.