Privacy preservation by disassociation

Authors:
Manolis Terrovitis;Nikos Mamoulis;John Liagouris;Spiros Skiadopoulos
Affiliations:
IMIS, Research Center 'Athena';Univ. of Hong Kong;NTUA;Univ. of Peloponnese
Venue:
Proceedings of the VLDB Endowment
Year:
2012

Citing 28
Cited 2

Clustering transactions using large items

Proceedings of the eighth international conference on Information and knowledge management
Real world performance of association rule algorithms

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Protecting Respondents' Identities in Microdata Release

IEEE Transactions on Knowledge and Data Engineering
Discovery of Multiple-Level Association Rules from Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
On k-anonymity and the curse of dimensionality

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Mondrian Multidimensional K-Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Anatomy: simple and effective privacy preservation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Thoughts on k-anonymization

Data & Knowledge Engineering
Efficient query evaluation on probabilistic databases

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Anonymity preserving pattern discovery

The VLDB Journal — The International Journal on Very Large Data Bases
Anonymizing transaction databases for publication

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies

IEEE Transactions on Knowledge and Data Engineering
Privacy-preserving anonymization of set-valued data

Proceedings of the VLDB Endowment
Anonymizing bipartite graph data using safe groupings

Proceedings of the VLDB Endowment
Anonymizing moving objects: how to hide a MOB in a crowd?

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Releasing search queries and clicks privately

Proceedings of the 18th international conference on World wide web
On the Anonymization of Sparse High-Dimensional Data

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Anonymization of set-valued data via top-down, local generalization

Proceedings of the VLDB Endowment
Combining fragmentation and encryption to protect privacy in data storage

ACM Transactions on Information and System Security (TISSEC)
Embellishing text search queries to protect user privacy

Proceedings of the VLDB Endowment
ρ-uncertainty: inference-proof transaction anonymization

Proceedings of the VLDB Endowment
Local and global recoding methods for anonymizing set-valued data

The VLDB Journal — The International Journal on Very Large Data Bases
Anonymous Search Histories Featuring Personalized Advertisement-Balancing Privacy with Economic Interests

Transactions on Data Privacy
Slicing: A New Approach for Privacy Preserving Data Publishing

IEEE Transactions on Knowledge and Data Engineering
Calibrating noise to sensitivity in private data analysis

TCC'06 Proceedings of the Third conference on Theory of Cryptography

Extending loose associations to multiple fragments

DBSec'13 Proceedings of the 27th international conference on Data and Applications Security and Privacy XXVII
Efficient Time-Stamped Event Sequence Anonymization

ACM Transactions on the Web (TWEB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work, we focus on protection against identity disclosure in the publication of sparse multidimensional data. Existing multidimensional anonymization techniques (a) protect the privacy of users either by altering the set of quasi-identifiers of the original data (e.g., by generalization or suppression) or by adding noise (e.g., using differential privacy) and/or (b) assume a clear distinction between sensitive and non-sensitive information and sever the possible linkage. In many real world applications the above techniques are not applicable. For instance, consider web search query logs. Suppressing or generalizing anonymization methods would remove the most valuable information in the dataset: the original query terms. Additionally, web search query logs contain millions of query terms which cannot be categorized as sensitive or non-sensitive since a term may be sensitive for a user and non-sensitive for another. Motivated by this observation, we propose an anonymization technique termed disassociation that preserves the original terms but hides the fact that two or more different terms appear in the same record. We protect the users' privacy by disassociating record terms that participate in identifying combinations. This way the adversary cannot associate with high probability a record with a rare combination of terms. To the best of our knowledge, our proposal is the first to employ such a technique to provide protection against identity disclosure. We propose an anonymization algorithm based on our approach and evaluate its performance on real and synthetic datasets, comparing it against other state-of-the-art methods based on generalization and differential privacy.