Efficient Anonymizations with Enhanced Utility

Authors:
Jacob Goldberger;Tamir Tassa
Affiliations:
School of Engineering/ Bar-Ilan University/ Ramat-Gan/ Israel. e-mail: goldbej@eng.biu.ac.il;Division of Computer Science/ The Open University/ Ra'anana/ Israel. e-mail: tamirta@openu.ac.il
Venue:
Transactions on Data Privacy
Year:
2010

Citing 22
Cited 5

Elements of information theory

Elements of information theory
Generalizing data to provide anonymity when disclosing information (abstract)

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Unsupervised document classification using sequential information maximization

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Analysis of the Clustering Properties of the Hilbert Space-Filling Curve

IEEE Transactions on Knowledge and Data Engineering
Transforming data to satisfy privacy constraints

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Data Privacy through Optimal k-Anonymization

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
On the complexity of optimal K-anonymity

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Mondrian Multidimensional K-Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Thoughts on k-Anonymization

ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
Achieving anonymity via clustering

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Injecting utility into anonymized datasets

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
(α, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Anatomy: simple and effective privacy preservation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Approximate algorithms for K-anonymity

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
A Critique of k-Anonymity and Some of Its Enhancements

ARES '08 Proceedings of the 2008 Third International Conference on Availability, Reliability and Security
k-Anonymization with Minimal Loss of Information

IEEE Transactions on Knowledge and Data Engineering
A framework for efficient data anonymization under privacy and accuracy constraints

ACM Transactions on Database Systems (TODS)
k-Anonymization Revisited

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Efficient k-anonymization using clustering techniques

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications

Limiting disclosure of sensitive data in sequential releases of databases

Information Sciences: an International Journal
Secure distributed computation of anonymized views of shared databases

ACM Transactions on Database Systems (TODS)
A practical approximation algorithm for optimal k-anonymity

Data Mining and Knowledge Discovery
k-Concealment: An Alternative Model of k-Type Anonymity

Transactions on Data Privacy
Improving accuracy of classification models induced from anonymized datasets

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the most well studied models of privacy preservation is k-anonymity. Previous studies of k-anonymization used various utility measures that aim at enhancing the correlation between the original public data and the generalized public data. We, bearing in mind that a primary goal in releasing the anonymized database for datamining is to deducemethods of predicting the private data from the public data, propose a new information-theoretic measure that aims at enhancing the correlation between the generalized public data and the private data. Such a measure significantly enhances the utility of the released anonymized database for data mining. We then proceed to describe a new algorithm that is designed to achieve k-anonymity with high utility, independently of the underlying utility measure. That algorithm is based on a modified version of sequential clustering which is the method of choice in clustering. Experimental comparison with four well known algorithms of k-anonymity show that the sequential clustering algorithm is an efficient algorithm that achieves the best utility results. We also describe a modification of the algorithm that outputs k-anonymizations which respect the additional security measure of l-diversity.