Re-encoding of perceptually quantized wavelet packet transform coefficients of audio and high quality speech

  • Authors:
  • Omid Ghahabi;Mohammad H. Savoji

  • Affiliations:
  • Department of Electrical and Computer Engineering, Shahid Beheshti University, Tehran, Iran;Department of Electrical and Computer Engineering, Shahid Beheshti University, Tehran, Iran

  • Venue:
  • DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper reports on the results of four reencoding schemes on perceptually quantized wavelet packet transform (WPT) coefficients of audio and high quality speech. These schemes comprises: 1- Embedded Zero-tree Wavelet (EZW) 2- The set partitioning in hierarchical trees (SPIHT) 3- JPEG-based entropy/run length Huffman and 4- JPEG-type Audio Huffman coding algorithms. Since EZW and SPIHT are designed for image compression, some new modifications have been implemented in these schemes for their better matching with audio signals. The performances of these four re-encoders are compared in terms of average output bit rate and computation time of a same codec. It is concluded that the JPEG-type Audio Huffman coding achieves the best results although it is not possible to truncate the bit stream, in this case, to easily match the bit rate to the fixed channel capacity.