Language identification of encrypted VoIP traffic: Alejandra y Roberto or Alice and Bob?

Authors:
Charles V. Wright;Lucas Ballard;Fabian Monrose;Gerald M. Masson
Affiliations:
Department of Computer Science, Johns Hopkins University, Baltimore, MD;Department of Computer Science, Johns Hopkins University, Baltimore, MD;Department of Computer Science, Johns Hopkins University, Baltimore, MD;Department of Computer Science, Johns Hopkins University, Baltimore, MD
Venue:
SS'07 Proceedings of 16th USENIX Security Symposium on USENIX Security Symposium
Year:
2007

Citing 12
Cited 22

Fundamentals of speech recognition

Fundamentals of speech recognition
Protocol failure in the escrowed encryption standard

CCS '94 Proceedings of the 2nd ACM Conference on Computer and communications security
Voice over IPsec: Analysis and Solutions

ACSAC '02 Proceedings of the 18th Annual Computer Security Applications Conference
Statistical Identification of Encrypted Web Browsing Traffic

SP '02 Proceedings of the 2002 IEEE Symposium on Security and Privacy
A CELP Variable Rate Speech Codec with Low Average Rate

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
A Variable-Rate Multimodal Speech Coder with Gain-Matched Analysis-by-Synthesis

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Tracking anonymous peer-to-peer VoIP calls on the internet

Proceedings of the 12th ACM conference on Computer and communications security
On Inferring Application Protocol Behaviors in Encrypted Network Traffic

The Journal of Machine Learning Research
Detecting stepping stones

SSYM'00 Proceedings of the 9th conference on USENIX Security Symposium - Volume 9
Timing analysis of keystrokes and timing attacks on SSH

SSYM'01 Proceedings of the 10th conference on USENIX Security Symposium - Volume 10
Tor: the second-generation onion router

SSYM'04 Proceedings of the 13th conference on USENIX Security Symposium - Volume 13
SP 800-58. Security Considerations for Voice Over IP Systems

SP 800-58. Security Considerations for Voice Over IP Systems

Devices that tell on you: privacy trends in consumer ubiquitous computing

SS'07 Proceedings of 16th USENIX Security Symposium on USENIX Security Symposium
Improving wireless privacy with an identifier-free link layer protocol

Proceedings of the 6th international conference on Mobile systems, applications, and services
Advanced Network Fingerprinting

RAID '08 Proceedings of the 11th international symposium on Recent Advances in Intrusion Detection
Performing traffic analysis on a wireless identifier-free link layer

The Fifth Richard Tapia Celebration of Diversity in Computing Conference: Intellect, Initiatives, Insight, and Innovations
Models of Privacy Preserving Traffic Tunneling

Simulation
Physical Layer Attacks on Unlinkability in Wireless LANs

PETS '09 Proceedings of the 9th International Symposium on Privacy Enhancing Technologies
A Survey of Voice over IP Security Research

ICISS '09 Proceedings of the 5th International Conference on Information Systems Security
Traffic classification using visual motifs: an empirical evaluation

Proceedings of the Seventh International Symposium on Visualization for Cyber Security
Uncovering Spoken Phrases in Encrypted Voice over IP Conversations

ACM Transactions on Information and System Security (TISSEC)
Speaker recognition in encrypted voice streams

ESORICS'10 Proceedings of the 15th European conference on Research in computer security
D(e|i)aling with VoIP: robust prevention of DIAL attacks

ESORICS'10 Proceedings of the 15th European conference on Research in computer security
Hidden VoIP calling records from networking intermediaries

Principles, Systems and Applications of IP Telecommunications
Traffic analysis attacks on Skype VoIP calls

Computer Communications
Inferring users' online activities through traffic analysis

Proceedings of the fourth ACM conference on Wireless network security
Protecting against physical resource monitoring

Proceedings of the 10th annual ACM workshop on Privacy in the electronic society
Website fingerprinting in onion routing based anonymization networks

Proceedings of the 10th annual ACM workshop on Privacy in the electronic society
Televisions, video privacy, and powerline electromagnetic interference

Proceedings of the 18th ACM conference on Computer and communications security
Peer-to-Peer VoIP communications using anonymisation overlay networks

CMS'10 Proceedings of the 11th IFIP TC 6/TC 11 international conference on Communications and Multimedia Security
Speaker recognition from encrypted VoIP communications

Digital Investigation: The International Journal of Digital Forensics & Incident Response
A new cell-counting-based attack against Tor

IEEE/ACM Transactions on Networking (TON)
SkypeMorph: protocol obfuscation for Tor bridges

Proceedings of the 2012 ACM conference on Computer and communications security
Cover your ACKs: pitfalls of covert channel censorship circumvention

Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security

Quantified Score

Hi-index	0.00

Visualization

Abstract

Voice over IP (VoIP) has become a popular protocol for making phone calls over the Internet. Due to the potential transit of sensitive conversations over untrusted network infrastructure, it is well understood that the contents of a VoIP session should be encrypted. However, we demonstrate that current cryptographic techniques do not provide adequate protection when the underlying audio is encoded using bandwidth-saving Variable Bit Rate (VBR) coders. Explicitly, we use the length of encrypted VoIP packets to tackle the challenging task of identifying the language of the conversation. Our empirical analysis of 2,066 native speakers of 21 different languages shows that a substantial amount of information can be discerned from encrypted VoIP traffic. For instance, our 21-way classifier achieves 66% accuracy, almost a 14-fold improvement over random guessing. For 14 of the 21 languages, the accuracy is greater than 90%. We achieve an overall binary classification (e.g., "Is this a Spanish or English conversation?") rate of 86.6%. Our analysis highlights what we believe to be interesting new privacy issues in VoIP.