Towards publishing recommendation data with predictive anonymization

Authors:
Chih-Cheng Chang;Brian Thompson;Hui (Wendy) Wang;Danfeng Yao
Affiliations:
Rutgers University, Piscataway, NJ;Rutgers University, Piscataway, NJ;Stevens Institute of Technology, Hoboken, NJ;Rutgers University, Piscataway, NJ
Venue:
ASIACCS '10 Proceedings of the 5th ACM Symposium on Information, Computer and Communications Security
Year:
2010

Citing 28
Cited 3

Using collaborative filtering to weave an information tapestry

Communications of the ACM - Special issue on information filtering
GroupLens: an open architecture for collaborative filtering of netnews

CSCW '94 Proceedings of the 1994 ACM conference on Computer supported cooperative work
Generalizing data to provide anonymity when disclosing information (abstract)

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Using filtering agents to improve prediction quality in the GroupLens research collaborative filtering system

CSCW '98 Proceedings of the 1998 ACM conference on Computer supported cooperative work
Combining collaborative filtering with personal agents for better recommendations

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Collaborative filtering with privacy via factor analysis

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Eigentaste: A Constant Time Collaborative Filtering Algorithm

Information Retrieval
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Transforming data to satisfy privacy constraints

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Collaborative Filtering with Privacy

SP '02 Proceedings of the 2002 IEEE Symposium on Security and Privacy
Privacy-Preserving Collaborative Filtering Using Randomized Perturbation Techniques

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
TiVo: making show recommendations using a distributed collaborative filtering architecture

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
\ell -Diversity: Privacy Beyond \kappa -Anonymity

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Achieving anonymity via clustering

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Injecting utility into anonymized datasets

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Utility-based anonymization using local recoding

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography

Proceedings of the 16th international conference on World Wide Web
The boundary between privacy and utility in data publishing

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Mechanism Design via Differential Privacy

FOCS '07 Proceedings of the 48th Annual IEEE Symposium on Foundations of Computer Science
Preserving Privacy in Joining Recommender Systems

ISA '08 Proceedings of the 2008 International Conference on Information Security and Assurance (isa 2008)
Resisting structural re-identification in anonymized social networks

Proceedings of the VLDB Endowment
Anonymizing bipartite graph data using safe groupings

Proceedings of the VLDB Endowment
The union-split algorithm and cluster-based anonymization of social networks

Proceedings of the 4th International Symposium on Information, Computer, and Communications Security
On the Anonymization of Sparse High-Dimensional Data

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Preserving Privacy in Social Networks Against Neighborhood Attacks

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Efficient k-anonymization using clustering techniques

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Differential privacy

ICALP'06 Proceedings of the 33rd international conference on Automata, Languages and Programming - Volume Part II
Calibrating noise to sensitivity in private data analysis

TCC'06 Proceedings of the Third conference on Theory of Cryptography

PCTA: privacy-constrained clustering-based transaction data anonymization

Proceedings of the 4th International Workshop on Privacy and Anonymity in the Information Society
Utility-guided Clustering-based Transaction Data Anonymization

Transactions on Data Privacy
Extending loose associations to multiple fragments

DBSec'13 Proceedings of the 27th international conference on Data and Applications Security and Privacy XXVII

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recommender systems are used to predict user preferences for products or services. In order to seek better prediction techniques, data owners of recommender systems such as Netflix sometimes make their customers' reviews available to the public, which raises serious privacy concerns. With only a small amount of knowledge about individuals and their ratings to some items in a recommender system, an adversary may easily identify the users and breach their privacy. Unfortunately, most of the existing privacy models (e.g., k-anonymity) cannot be directly applied to recommender systems. In this paper, we study the problem of privacy-preserving publishing of recommendation datasets. We represent recommendation data as a bipartite graph, and identify several attacks that can re-identify users and determine their item ratings. To deal with these attacks, we first give formal privacy definitions for recommendation data, and then develop a robust and efficient anonymization algorithm, Predictive Anonymization, to achieve our privacy goals. Our experimental results show that Predictive Anonymization can prevent the attacks with very little impact to prediction accuracy.