Scalable audio coding using the nonuniform modulated complex lapped transform

  • Authors:
  • A. S. Scheuble;Zixiang Xiong

  • Affiliations:
  • Dept. of Electr. Eng., Texas A&MUniv., College Station, TX, USA;-

  • Venue:
  • ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces a scalable audio coder using the nonuniform modulated complex lapped transform (NMCLT), which is a new nonuniform oversampled filter bank with a better combination of time- and frequency-domain localization than previous designs. Masking functions for different critical Bark bands are first calculated directly from the NMCLT coefficients as perceptual weights and arithmetic coding is then used to compress bit planes of the weighted NMCLT coefficients to generate a perceptually scalable audio bitstream. The loss in coding performance due to oversampling is offset by limiting the amount of redundancy in the transform and exploiting the correlations among the NMCLT basis functions. Experiments show that our new coder outperforms a coder with the modulated lapped transform (MLT) both objectively and subjectively.