Re-encoding of perceptually quantized wavelet packet transform coefficients of audio and high quality speech

Authors:
Omid Ghahabi;Mohammad H. Savoji
Affiliations:
Department of Electrical and Computer Engineering, Shahid Beheshti University, Tehran, Iran;Department of Electrical and Computer Engineering, Shahid Beheshti University, Tehran, Iran
Venue:
DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Year:
2009

Citing 6
Cited 0

Modified SPIHT algorithm for wavelet packet image coding

Real-Time Imaging - Special issue on multi-dimensional image processing
Analysis of low bit rate image transform coding

IEEE Transactions on Signal Processing
Smooth wavelets, transform coding, and Markov-1 processes

IEEE Transactions on Signal Processing
Quantifying the Coding Performance of Zerotrees of Wavelet Coefficients: Degree-k Zerotree

IEEE Transactions on Signal Processing
Adaptive wavelet packet basis selection for zerotree image coding

IEEE Transactions on Image Processing
A new, fast, and efficient image codec based on set partitioning in hierarchical trees

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper reports on the results of four reencoding schemes on perceptually quantized wavelet packet transform (WPT) coefficients of audio and high quality speech. These schemes comprises: 1- Embedded Zero-tree Wavelet (EZW) 2- The set partitioning in hierarchical trees (SPIHT) 3- JPEG-based entropy/run length Huffman and 4- JPEG-type Audio Huffman coding algorithms. Since EZW and SPIHT are designed for image compression, some new modifications have been implemented in these schemes for their better matching with audio signals. The performances of these four re-encoders are compared in terms of average output bit rate and computation time of a same codec. It is concluded that the JPEG-type Audio Huffman coding achieves the best results although it is not possible to truncate the bit stream, in this case, to easily match the bit rate to the fixed channel capacity.