Spatial parameters for audio coding: MDCT domain analysis and synthesis

Authors:
Shuixian Chen;Naixue Xiong;Jong Hyuk Park;Min Chen;Ruimin Hu
Affiliations:
Computer School, Wuhan University, Wuhan, China;Department of Computer Science, Georgia State University, Atlanta, USA;Department of Computer Science and Engineering, Kyungnam University, Masan, Korea;School of Computer Science & Engineering, Seoul National University, Seoul, Korea 151-744;Computer School, Wuhan University, Wuhan, China
Venue:
Multimedia Tools and Applications
Year:
2010

Citing 8
Cited 2

Signal Processing with Lapped Transforms

Signal Processing with Lapped Transforms
MPEG Surround

IEEE MultiMedia
A modulated complex lapped transform and its applications to audio processing

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 03
Parametric coding of stereo audio

EURASIP Journal on Applied Signal Processing
Binaural rendering in MPEG surround

EURASIP Journal on Advances in Signal Processing
Fast IMDCT and MDCT algorithms - a matrix approach

IEEE Transactions on Signal Processing
Parametric multichannel audio coding: synthesis of coherence cues

IEEE Transactions on Audio, Speech, and Language Processing
A Backward-Compatible Multichannel Audio Codec

IEEE Transactions on Audio, Speech, and Language Processing

DFT spectrum estimation from critically sampled lapped transforms

Signal Processing
New generalized conversion method of the MDCT to MDST coefficients in the frequency domain for arbitrary symmetric windowing function

Digital Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We use Modified Discrete Cosine Transform (MDCT) to analyze and synthesize spatial parameters. MDCT in itself lacks phase information and energy conservation, which are needed by spatial parameters representation. Completing MDCT with Modified Discrete Sine Transform (MDST) into "MDCT-j*MDST" overcomes this and enables the representation in a form similar to that of DFT. And due to overlap-add in time domain, a MDST spectrum can be built perfectly from MDCT spectra of neighboring frames through matrix-vector multiplication. The matrix is heavily diagonal and keeping only a small number of its sub-diagonals is sufficient for approximation. When using MDCT based core coder in spatial audio coding, like Advanced Audio Coding (AAC), we need no separate transforming for spatial processing, cutting down significantly the computational complexity. Subjective listening tests also show that MDCT domain spatial processing has no quality impairment.