Perceptual and objective detection of discontinuities in concatenative speech synthesis
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Hi-index | 0.00 |
In [1,2,3], Binaural Cue Coding (BCC) was introduced for multi-channel spatial rendering for MPEG 4 – SAC (Spatial Audio Coding) to reduce bitrate of multi-channel audio signal. In [4,5], Virtual Source Location Information (VSLI) was introduced to replace Inter-Channel Level Difference, the most determinant parameter in BCC system. Here, Variable Bit Quantization (VBQ) for VSLI is proposed to reduce bitrate at the quantization block in VSLI-based BCC systems removing statistically invalid range.