On the privacy of anonymized networks

Authors:
Pedram Pedarsani;Matthias Grossglauser
Affiliations:
EPFL, Lausanne, Switzerland;EPFL, Lausanne, Switzerland
Venue:
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2011

Citing 29
Cited 1

An Algorithm for Subgraph Isomorphism

Journal of the ACM (JACM)
A random graph model for massive graphs

STOC '00 Proceedings of the thirty-second annual ACM symposium on Theory of computing
On near-uniform URL sampling

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Bayesian Graph Edit Distance

IEEE Transactions on Pattern Analysis and Machine Intelligence
Structural Graph Matching Using the EM Algorithm and Singular Value Decomposition

IEEE Transactions on Pattern Analysis and Machine Intelligence - Graph Algorithms and Computer Vision
Deanonymizing Users of the SafeWeb Anonymizing Service

Proceedings of the 11th USENIX Security Symposium
A (Sub)Graph Isomorphism Algorithm for Matching Large Graphs

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exact and Approximate Graph Matching Using Random Walks

IEEE Transactions on Pattern Analysis and Machine Intelligence
Graphs over time: densification laws, shrinking diameters and possible explanations

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Information revelation and privacy in online social networks

Proceedings of the 2005 ACM workshop on Privacy in the electronic society
Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography

Proceedings of the 16th international conference on World Wide Web
Impact of Human Mobility on Opportunistic Forwarding Algorithms

IEEE Transactions on Mobile Computing
Measurement and analysis of online social networks

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Densification arising from sampling fixed graphs

SIGMETRICS '08 Proceedings of the 2008 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Growth of the flickr social network

Proceedings of the first workshop on Online social networks
Characterizing privacy in online social networks

Proceedings of the first workshop on Online social networks
NOYB: privacy in online social networks

Proceedings of the first workshop on Online social networks
Robust De-anonymization of Large Sparse Datasets

SP '08 Proceedings of the 2008 IEEE Symposium on Security and Privacy
The structure of information pathways in a social communication network

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Link privacy in social networks

Proceedings of the 17th ACM conference on Information and knowledge management
TALE: A Tool for Approximate Large Graph Matching

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Preserving Privacy in Social Networks Against Neighborhood Attacks

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
On unbiased sampling for unstructured peer-to-peer networks

IEEE/ACM Transactions on Networking (TON)
Meme-tracking and the dynamics of the news cycle

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
De-anonymizing Social Networks

SP '09 Proceedings of the 2009 30th IEEE Symposium on Security and Privacy
Kronecker Graphs: An Approach to Modeling Networks

The Journal of Machine Learning Research
Predicting positive and negative links in online social networks

Proceedings of the 19th international conference on World wide web
A Practical Attack to De-anonymize Social Network Users

SP '10 Proceedings of the 2010 IEEE Symposium on Security and Privacy
Resisting structural re-identification in anonymized social networks

The VLDB Journal — The International Journal on Very Large Data Bases

On the performance of percolation graph matching

Proceedings of the first ACM conference on Online social networks

Quantified Score

Hi-index	0.01

Visualization

Abstract

The proliferation of online social networks, and the concomitant accumulation of user data, give rise to hotly debated issues of privacy, security, and control. One specific challenge is the sharing or public release of anonymized data without accidentally leaking personally identifiable information (PII). Unfortunately, it is often difficult to ascertain that sophisticated statistical techniques, potentially employing additional external data sources, are unable to break anonymity. In this paper, we consider an instance of this problem, where the object of interest is the structure of a social network, i.e., a graph describing users and their links. Recent work demonstrates that anonymizing node identities may not be sufficient to keep the network private: the availability of node and link data from another domain, which is correlated with the anonymized network, has been used to re-identify the anonymized nodes. This paper is about conditions under which such a de-anonymization process is possible. We attempt to shed light on the following question: can we assume that a sufficiently sparse network is inherently anonymous, in the sense that even with unlimited computational power, de-anonymization is impossible? Our approach is to introduce a random graph model for a version of the de-anonymization problem, which is parameterized by the expected node degree and a similarity parameter that controls the correlation between two graphs over the same vertex set. We find simple conditions on these parameters delineating the boundary of privacy, and show that the mean node degree need only grow slightly faster than log n with network size n for nodes to be identifiable. Our results have policy implications for sharing of anonymized network information.