Distributed privacy preserving data collection

Authors:
Mingqiang Xue;Panagiotis Papadimitriou;Chedy Raïssi;Panos Kalnis;Hung Keng Pung
Affiliations:
Computer Science Department, National University of Singapore;Stanford University;INRIA Nancy;King Abdullah University of Science and Technology;Computer Science Department, National University of Singapore
Venue:
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Year:
2011

Citing 18
Cited 0

Generalizing data to provide anonymity when disclosing information (abstract)

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
How to share a secret

Communications of the ACM
Analysis of the Clustering Properties of the Hilbert Space-Filling Curve

IEEE Transactions on Knowledge and Data Engineering
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Data Privacy through Optimal k-Anonymization

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
On the complexity of optimal K-anonymity

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Privacy-enhancing k-anonymization of customer data

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Anonymity-preserving data collection

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Efficient anonymity-preserving data collection

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Threshold cryptography based on Asmuth-Bloom secret sharing

Information Sciences: an International Journal
Fast data anonymization with low information loss

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Privacy-preserving data publishing for horizontally partitioned databases

Proceedings of the 17th ACM conference on Information and knowledge management
k-Anonymous data collection

Information Sciences: an International Journal
Public-key cryptosystems based on composite degree residuosity classes

EUROCRYPT'99 Proceedings of the 17th international conference on Theory and application of cryptographic techniques
Single-database private information retrieval with constant communication rate

ICALP'05 Proceedings of the 32nd international conference on Automata, Languages and Programming
Unconditionally secure constant-rounds multi-party computation for equality, comparison, bits and exponentiation

TCC'06 Proceedings of the Third conference on Theory of Cryptography

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study the distributed privacy preserving data collection problem: an untrusted data collector (e.g., a medical research institute) wishes to collect data (e.g., medical records) from a group of respondents (e.g., patients). Each respondent owns a multi-attributed record which contains both non-sensitive (e.g., quasi-identifiers) and sensitive information (e.g., a particular disease), and submits it to the data collector. Assuming T is the table formed by all the respondent data records, we say that the data collection process is privacy preserving if it allows the data collector to obtain a k-anonymized or l-diversified version of T without revealing the original records to the adversary. We propose a distributed data collection protocol that outputs an anonymized table by generalization of quasi-identifier attributes. The protocol employs cryptographic techniques such as homomorphic encryption, private information retrieval and secure multiparty computation to ensure the privacy goal in the process of data collection. Meanwhile, the protocol is designed to leak limited but noncritical information to achieve practicability and efficiency. Experiments show that the utility of the anonymized table derived by our protocol is in par with the utility achieved by traditional anonymization techniques.