Anonymity preserving pattern discovery

Authors:
Maurizio Atzori;Francesco Bonchi;Fosca Giannotti;Dino Pedreschi
Affiliations:
Pisa KDD Laboratory, ISTI---CNR, Pisa, Italy 56124 and Pisa KDD Laboratory, Computer Science Department, University of Pisa, Pisa, Italy 56127;Pisa KDD Laboratory, ISTI---CNR, Pisa, Italy 56124;Pisa KDD Laboratory, ISTI---CNR, Pisa, Italy 56124;Pisa KDD Laboratory, Computer Science Department, University of Pisa, Pisa, Italy 56127
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2008

Citing 50
Cited 21

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Generalizing data to provide anonymity when disclosing information (abstract)

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Security of random data perturbation methods

ACM Transactions on Database Systems (TODS)
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
On the design and quantification of privacy preserving data mining algorithms

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A fast distributed algorithm for mining association rules

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Principles of data mining

Principles of data mining
Secure multi-party computation problems and their applications: a review and open problems

Proceedings of the 2001 workshop on New security paradigms
Medical Data Mining and Knowledge Discovery

Medical Data Mining and Knowledge Discovery
Using unknowns to prevent discovery of association rules

ACM SIGMOD Record
Protecting Respondents' Identities in Microdata Release

IEEE Transactions on Knowledge and Data Engineering
Mining All Non-derivable Frequent Itemsets

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
An Integrated Framework for Database Privacy Protection

Proceedings of the IFIP TC11/ WG11.3 Fourteenth Annual Working Conference on Database Security: Data and Application Security, Development and Directions
Hiding Association Rules by Using Confidence and Support

IHW '01 Proceedings of the 4th International Workshop on Information Hiding
Cryptographic techniques for privacy-preserving data mining

ACM SIGKDD Explorations Newsletter
Tools for privacy preserving distributed data mining

ACM SIGKDD Explorations Newsletter
Randomization in privacy preserving data mining

ACM SIGKDD Explorations Newsletter
Limiting privacy breaches in privacy preserving data mining

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Achieving k-anonymity privacy protection using generalization and suppression

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Privacy preserving mining of association rules

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy preserving association rule mining in vertically partitioned data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Disclosure Limitation of Sensitive Rules

KDEX '99 Proceedings of the 1999 Workshop on Knowledge and Data Engineering Exchange
Building decision tree classifier on private data

CRPIT '14 Proceedings of the IEEE international conference on Privacy, security and data mining - Volume 14
Privacy preserving frequent itemset mining

CRPIT '14 Proceedings of the IEEE international conference on Privacy, security and data mining - Volume 14
A Secure Protocol for Computing Dot-Products in Clustered and Distributed Environments

ICPP '02 Proceedings of the 2002 International Conference on Parallel Processing
Protecting Sensitive Knowledge By Data Sanitization

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
On the Privacy Preserving Properties of Random Data Perturbation Techniques

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
CLOSET+: searching for the best strategies for mining frequent closed itemsets

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Using randomized response techniques for privacy-preserving data mining

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
State-of-the-art in privacy preserving data mining

ACM SIGMOD Record
A framework for privacy preserving classification in data mining

ACSW Frontiers '04 Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation - Volume 32
Detecting privacy and ethical sensitivity in data mining results

ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
When do data mining results violate privacy?

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A Framework for High-Accuracy Privacy-Preserving Mining

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Computational complexity of itemset frequency satisfiability

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Deriving private information from randomized data

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Incognito: efficient full-domain K-anonymity

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
On k-anonymity and the curse of dimensionality

VLDB '05 Proceedings of the 31st international conference on Very large data bases
A Border-Based Approach for Hiding Sensitive Frequent Itemsets

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Template-Based Privacy Preservation in Classification Problems

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Blocking Anonymity Threats Raised by Frequent Itemset Mining

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Suppressing Data Sets to Prevent Discovery of Association Rules

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Injecting utility into anonymized datasets

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Anatomy: simple and effective privacy preservation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Maintaining data privacy in association rule mining

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Weak k-anonymity: a low-distortion model for protecting privacy

ISC'06 Proceedings of the 9th international conference on Information Security
k-anonymous patterns

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases

On static and dynamic methods for condensation-based privacy-preserving data mining

ACM Transactions on Database Systems (TODS)
Privacy-preserving anonymization of set-valued data

Proceedings of the VLDB Endowment
Towards Trajectory Anonymization: a Generalization-Based Approach

Transactions on Data Privacy
Anonymization of set-valued data via top-down, local generalization

Proceedings of the VLDB Endowment
Privacy-preserving data publishing: A survey of recent developments

ACM Computing Surveys (CSUR)
Movement Data Anonymity through Generalization

Transactions on Data Privacy
Anonymization of moving objects databases by clustering and perturbation

Information Systems
Fuzzy based clustering algorithm for privacy preserving data mining

International Journal of Business Information Systems
Output privacy in data mining

ACM Transactions on Database Systems (TODS)
Local and global recoding methods for anonymizing set-valued data

The VLDB Journal — The International Journal on Very Large Data Bases
Revisiting sequential pattern hiding to enhance utility

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Publishing anonymous survey rating data

Data Mining and Knowledge Discovery
Privacy-preserving location publishing under road-network constraints

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Secure Distributed Subgroup Discovery in Horizontally Partitioned Data

Transactions on Data Privacy
Information fusion in data privacy: A survey

Information Fusion
On the identity anonymization of high-dimensional rating data

Concurrency and Computation: Practice & Experience
Privacy preservation by disassociation

Proceedings of the VLDB Endowment
PrivBasis: frequent itemset mining with differential privacy

Proceedings of the VLDB Endowment
Mining social media: key players, sentiments, and communities

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
On differentially private frequent itemset mining

Proceedings of the VLDB Endowment
Effective sanitization approaches to hide sensitive utility and frequent itemsets

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

It is generally believed that data mining results do not violate the anonymity of the individuals recorded in the source database. In fact, data mining models and patterns, in order to ensure a required statistical significance, represent a large number of individuals and thus conceal individual identities: this is the case of the minimum support threshold in frequent pattern mining. In this paper we show that this belief is ill-founded. By shifting the concept of k -anonymity from the source data to the extracted patterns, we formally characterize the notion of a threat to anonymity in the context of pattern discovery, and provide a methodology to efficiently and effectively identify all such possible threats that arise from the disclosure of the set of extracted patterns. On this basis, we obtain a formal notion of privacy protection that allows the disclosure of the extracted knowledge while protecting the anonymity of the individuals in the source database. Moreover, in order to handle the cases where the threats to anonymity cannot be avoided, we study how to eliminate such threats by means of pattern (not data!) distortion performed in a controlled way.