Speech compression with masked modulated lapped transform and SPIHT algorithm

Authors:
Maitreyee Dutta;Renu Vig
Affiliations:
Computer Science Department, National Institute of Technical Teachers' Training and Research, Chandigarh, India;Department of Information Technology, Panjab University, Chandigarh, India
Venue:
ISCGAV'05 Proceedings of the 5th WSEAS International Conference on Signal Processing, Computational Geometry & Artificial Vision
Year:
2005

Citing 2
Cited 0

A new, fast, and efficient image codec based on set partitioning in hierarchical trees

IEEE Transactions on Circuits and Systems for Video Technology
Image resizing in the compressed domain using subband DCT

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

A new method of speech compression using the Modulated Lapped Transform(MLT) and Set Partitioning In Hierarchical Trees (SPIHT) algorithm is proposed in this paper. The improvement in the signal with the application of masking with Psychoacoustic model has also been discussed. The proposed scheme is based on the combination of the Modulated Lapped Transform(MLT) and SPIHT. This paper also describes an excitation level based psychoacoustic model to estimate the simultaneous masking threshold for speech coding. The system has the following stages. 1) a windowing function; 2) a time-to-frequency transformation; 3) an excitation level calculation block 4) a correction factor for estimating masking threshold; 5) the inclusion of the absolute masking threshold; 6) the output Signal-to-Masking ratio. We evaluated the performance by integrating the psychoacoustic model into speech coding. Comparisons are also made with Plain LPC Coder, Voice Excited LPC Coder with the coding of the residual signal with DCT, Voice Excited LPC Coder with the coding of the residual signal with MLT, Voice Excited LPC Coder with the coding of the residual signal with MLT and SPIHT. The performance of the coders described has been assessed by computer simulation in terms of a) Signal -to -noise ratio (SNR) b) Compression ratio c) Percent Root mean square Difference(PRD). d) Informal subjective listening test.