On the privacy risks of publishing anonymized IP network traces

Authors:
D. Koukis;S. Antonatos;K. G. Anagnostakis
Affiliations:
Distributed Computing Systems Group, FORTH-ICS, Greece;Distributed Computing Systems Group, FORTH-ICS, Greece;Infocomm Security Department, Institute for Infocomm Research, Singapore
Venue:
CMS'06 Proceedings of the 10th IFIP TC-6 TC-11 international conference on Communications and Multimedia Security
Year:
2006

Citing 8
Cited 9

Wide-area traffic: the failure of Poisson modeling

SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
The macroscopic behavior of the TCP congestion avoidance algorithm

ACM SIGCOMM Computer Communication Review
How dynamic is the Web?

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Passive estimation of TCP round-trip times

ACM SIGCOMM Computer Communication Review
Sources and Characteristics of Web Temporal Locality

MASCOTS '00 Proceedings of the 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
Statistical Identification of Encrypted Web Browsing Traffic

SP '02 Proceedings of the 2002 IEEE Symposium on Security and Privacy
A high-level programming environment for packet trace anonymization and transformation

Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications
Fingerprinting websites using traffic analysis

PET'02 Proceedings of the 2nd international conference on Privacy enhancing technologies

On web browsing privacy in anonymized NetFlows

SS'07 Proceedings of 16th USENIX Security Symposium on USENIX Security Symposium
The risk-utility tradeoff for IP address truncation

Proceedings of the 1st ACM workshop on Network data anonymization
Reference models for network data anonymization

Proceedings of the 1st ACM workshop on Network data anonymization
A taxonomy and adversarial model for attacks against network log anonymization

Proceedings of the 2009 ACM symposium on Applied Computing
Browser Fingerprinting from Coarse Traffic Summaries: Techniques and Implications

DIMVA '09 Proceedings of the 6th International Conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Website fingerprinting: attacking popular privacy enhancing technologies with the multinomial naïve-bayes classifier

Proceedings of the 2009 ACM workshop on Cloud computing security
The role of network trace anonymization under attack

ACM SIGCOMM Computer Communication Review
Short paper: the NetSANI framework for analysis and fine-tuning of network trace sanitization

Proceedings of the fourth ACM conference on Wireless network security
Analyzing characteristic host access patterns for re-identification of web user sessions

NordSec'10 Proceedings of the 15th Nordic conference on Information Security Technology for Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Networking researchers and engineers rely on network packet traces for understanding network behavior, developing models, and evaluating network performance. Although the bulk of published packet traces implement a form of address anonymization to hide sensitive information, it has been unclear if such anonymization techniques are sufficient to address the privacy concerns of users and organizations. In this paper we attempt to quantify the risks of publishing anonymized packet traces. In particular, we examine whether statistical identification techniques can be used to uncover the identities of users and their surfing activities from anonymized packet traces. Our results show that such techniques can be used by any Web server that is itself present in the packet trace and has sufficient resources to map out and keep track of the content of popular Web sites to obtain information on the network-wide browsing behavior of its clients. Furthermore, we discuss how scan sequences identified in the trace can easily reveal the mapping from anonymized to real IP addresses.