Audio codec identification from coded and transcoded audios

Authors:
Samet Hicsonmez;Husrev T. Sencar;Ismail Avcibas
Affiliations:
Computer Engineering Department, TOBB Economics and Technology University, Ankara, Turkey;Computer Engineering Department, TOBB Economics and Technology University, Ankara, Turkey;Electrical and Electronics Engineering Department, Turgut Ozal University, Ankara, Turkey
Venue:
Digital Signal Processing
Year:
2013

Citing 10
Cited 0

Speech Coding and Synthesis

Speech Coding and Synthesis
Introduction to Digital Audio Coding and Standards

Introduction to Digital Audio Coding and Standards
Statistical characterisation of MP3 encoders for steganalysis

Proceedings of the 2004 workshop on Multimedia and security
Nabs: A System for Detecting Resource Abuses via Characterization of Flow Content Type

ACSAC '04 Proceedings of the 20th Annual Computer Security Applications Conference
Algorithms for Feature Selection: An Evaluation

ICPR '96 Proceedings of the 13th International Conference on Pattern Recognition - Volume 2
Defeating fake-quality MP3

Proceedings of the 11th ACM workshop on Multimedia and security
PinDr0p: using single-ended audio features to determine call provenance

Proceedings of the 17th ACM conference on Computer and communications security
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
Chaotic-Type Features for Speech Steganalysis

IEEE Transactions on Information Forensics and Security
Audio codec identification through payload sampling

WIFS '11 Proceedings of the 2011 IEEE International Workshop on Information Forensics and Security

Quantified Score

Hi-index	0.00

Visualization

Abstract

A novel technique is presented to identify the codec of a coded audio. The technique does not perform decoding, utilize any coding metadata, or assume information about the structure describing the bit stream format of a codec. The underlying idea of the technique is that the design choices governing the compression level, audio quality and complexity of a codec will reveal themselves on the coded audio. To exploit this, the technique samples 2-4 kilobytes of data from a coded audio and analyzes the randomness and chaotic nature of the sampled data to build statistical models that represent encoding process associated with different codecs. In experiments, we utilize 16 of the most popular audio codecs used for high quality audio compression and in PSTNs, cellular networks, and VoIP networks by setting encoding parameters of each codec to its most commonly used values. Results show that the codec of an encoded audio can be identified with an accuracy of more than 95 percent. Other experiments considering several transcoding scenarios were also performed. Those results show that the scheme can even discriminate the first encoder of a doubly-encoded audio with an accuracy range of around 80 to 90 percent or more as long as the second codec operates on higher bit rates than the first one.