Privacy-preserving anonymization of set-valued data

Authors:
Manolis Terrovitis;Nikos Mamoulis;Panos Kalnis
Affiliations:
University of Hong Kong;University of Hong Kong;National University of Singapore
Venue:
Proceedings of the VLDB Endowment
Year:
2008

Citing 19
Cited 38

Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Real world performance of association rule algorithms

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Protecting Respondents' Identities in Microdata Release

IEEE Transactions on Knowledge and Data Engineering
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Transforming data to satisfy privacy constraints

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Association Rule Hiding

IEEE Transactions on Knowledge and Data Engineering
Data Privacy through Optimal k-Anonymization

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
On the complexity of optimal K-anonymity

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Mondrian Multidimensional K-Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Achieving anonymity via clustering

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Utility-based anonymization using local recoding

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Anatomy: simple and effective privacy preservation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Approximate algorithms for K-anonymity

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Thoughts on k-anonymization

Data & Knowledge Engineering
Fast data anonymization with low information loss

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Anonymity preserving pattern discovery

The VLDB Journal — The International Journal on Very Large Data Bases
On the Anonymization of Sparse High-Dimensional Data

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering

Anonymizing healthcare data: a case study on the blood transfusion service

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Anonymized data: generation, models, usage

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Privacy-Preserving Data Publishing

Foundations and Trends in Databases
Walking in the crowd: anonymizing trajectory data for pattern analysis

Proceedings of the 18th ACM conference on Information and knowledge management
A framework for safely publishing communication traces

Proceedings of the 18th ACM conference on Information and knowledge management
Anonymization of set-valued data via top-down, local generalization

Proceedings of the VLDB Endowment
Privacy-preserving data publishing: A survey of recent developments

ACM Computing Surveys (CSUR)
The test data challenge for database-driven applications

Proceedings of the Third International Workshop on Testing Database Systems
Search-log anonymization and advertisement: are they mutually exclusive?

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Anonymizing transaction data to eliminate sensitive inferences

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
ρ-uncertainty: inference-proof transaction anonymization

Proceedings of the VLDB Endowment
Ontology-based anonymization of categorical values

MDAI'10 Proceedings of the 7th international conference on Modeling decisions for artificial intelligence
Instant anonymization

ACM Transactions on Database Systems (TODS)
Local and global recoding methods for anonymizing set-valued data

The VLDB Journal — The International Journal on Very Large Data Bases
Inference control to protect sensitive information in text documents

ACM SIGKDD Workshop on Intelligence and Security Informatics
PCTA: privacy-constrained clustering-based transaction data anonymization

Proceedings of the 4th International Workshop on Privacy and Anonymity in the Information Society
Privacy-aware collection of aggregate spatial data

Data & Knowledge Engineering
Anonymous Search Histories Featuring Personalized Advertisement-Balancing Privacy with Economic Interests

Transactions on Data Privacy
C-safety: a framework for the anonymization of semantic trajectories

Transactions on Data Privacy
Revisiting sequential pattern hiding to enhance utility

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Protecting privacy in data release

Foundations of security analysis and design VI
Privacy preservation in the dissemination of location data

ACM SIGKDD Explorations Newsletter
A publication process model to enable privacy-aware data sharing

IBM Journal of Research and Development
Significance of Term Relationships on Anonymization

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Hiding emerging patterns with local recoding generalization

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Anonymizing transaction data by integrating suppression and generalization

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Utility-preserving transaction data anonymization with low information loss

Expert Systems with Applications: An International Journal
Institute for the management of information systems Athena research center

ACM SIGMOD Record
Utility-guided Clustering-based Transaction Data Anonymization

Transactions on Data Privacy
Privacy protection of textual attributes through a semantic-based masking method

Information Fusion
Privacy preservation by disassociation

Proceedings of the VLDB Endowment
PrivBasis: frequent itemset mining with differential privacy

Proceedings of the VLDB Endowment
Clustering-oriented privacy-preserving data publishing

Knowledge-Based Systems
t-Plausibility: Generalizing Words to Desensitize Text

Transactions on Data Privacy
On differentially private frequent itemset mining

Proceedings of the VLDB Endowment
Privacy-preserving trajectory data publishing by local suppression

Information Sciences: an International Journal
Using safety constraint for transactional dataset anonymization

DBSec'13 Proceedings of the 27th international conference on Data and Applications Security and Privacy XXVII
Efficient Time-Stamped Event Sequence Anonymization

ACM Transactions on the Web (TWEB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we study the problem of protecting privacy in the publication of set-valued data. Consider a collection of transactional data that contains detailed information about items bought together by individuals. Even after removing all personal characteristics of the buyer, which can serve as links to his identity, the publication of such data is still subject to privacy attacks from adversaries who have partial knowledge about the set. Unlike most previous works, we do not distinguish data as sensitive and non-sensitive, but we consider them both as potential quasi-identifiers and potential sensitive data, depending on the point of view of the adversary. We define a new version of the k-anonymity guarantee, the km-anonymity, to limit the effects of the data dimensionality and we propose efficient algorithms to transform the database. Our anonymization model relies on generalization instead of suppression, which is the most common practice in related works on such data. We develop an algorithm which finds the optimal solution, however, at a high cost which makes it inapplicable for large, realistic problems. Then, we propose two greedy heuristics, which scale much better and in most of the cases find a solution close to the optimal. The proposed algorithms are experimentally evaluated using real datasets.