Cloning for privacy protection in multiple independent data publications

Authors:
Muzammil M. Baig;Jiuyong Li;Jixue Liu;Hua Wang
Affiliations:
University of South Australia, Mawson Lakes, Australia;University of South Australia, Mawson Lakes, Australia;University of South Australia, Mawson Lakes, Australia;University of Southern Queensland, Toowoomba, Australia
Venue:
Proceedings of the 20th ACM international conference on Information and knowledge management
Year:
2011

Citing 32
Cited 2

k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
On the complexity of optimal K-anonymity

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Checking for k-anonymity violation by views

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Mondrian Multidimensional K-Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Personalized privacy preservation

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
(α, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A secure distributed framework for achieving k-anonymity

The VLDB Journal — The International Journal on Very Large Data Bases
Anatomy: simple and effective privacy preservation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
L-diversity: Privacy beyond k-anonymity

ACM Transactions on Knowledge Discovery from Data (TKDD)
M-invariance: towards privacy preserving re-publication of dynamic datasets

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Hippocratic databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
k-Unlinkability: A privacy protection model for distributed data

Data & Knowledge Engineering
Minimality attack in privacy preserving data publishing

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Privacy skyline: privacy with multidimensional adversarial knowledge

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Anonymity for continuous data publishing

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
The cost of privacy: destruction of data-mining utility in anonymized data publishing

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Composition attacks and auxiliary information in data privacy

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies

IEEE Transactions on Knowledge and Data Engineering
On Anti-Corruption Privacy Preserving Publication

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Attacks on privacy and deFinetti's theorem

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Non-homogeneous generalization in privacy preserving data publishing

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Versatile publishing for privacy preservation

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Collusion-resistant privacy-preserving data mining

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Data mining with differential privacy

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Does differential privacy protect terry gross' privacy?

PSD'10 Proceedings of the 2010 international conference on Privacy in statistical databases
SABRE: a Sensitive Attribute Bucketization and REdistribution framework for t-closeness

The VLDB Journal — The International Journal on Very Large Data Bases
Differential privacy

ICALP'06 Proceedings of the 33rd international conference on Automata, Languages and Programming - Volume Part II
Anonymizing tables

ICDT'05 Proceedings of the 10th international conference on Database Theory
Privacy-preserving distributed k-anonymity

DBSec'05 Proceedings of the 19th annual IFIP WG 11.3 working conference on Data and Applications Security
Calibrating noise to sensitivity in private data analysis

TCC'06 Proceedings of the Third conference on Theory of Cryptography
Studying genotype-phenotype attack on k-anonymised medical and genomic data

AusDM '09 Proceedings of the Eighth Australasian Data Mining Conference - Volume 101

The application of differential privacy to health data

Proceedings of the 2012 Joint EDBT/ICDT Workshops
A privacy framework: indistinguishable privacy

Proceedings of the Joint EDBT/ICDT 2013 Workshops

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data anonymization has become a major technique in privacy preserving data publishing. Many methods have been proposed to anonymize one dataset and a series of datasets of a data owner. However, no method has been proposed for the anonymization of data of multiple independent data publications. A data owner publishes a dataset, which contains overlapping population with other datasets published by other independent data owners. In this paper we analyze the privacy risk in the such scenario and vulnerability of partitioned based anonymization methods. We show that no partitioned based anonymization methods can protect privacy in arbitrary data distributions, and identify a case that the privacy can be protected in the scenario. We propose a new generalization principle ε-cloning to protect privacy for multiple independent data publications. We also develop an effective algorithm to achieve the ε-cloning. We experimentally show that the proposed algorithm anonymizes data to satisfy the privacy requirement and preserves good data utility.