An efficient transcoding algorithm between AMR-NB and G.729ab

Authors:
Changchun Bao;Hao Xu;Bingyin Xia;Zhangyu Liu;Jianwei Qiu
Affiliations:
Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China;Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China;Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China;Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China;Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China
Venue:
Speech Communication
Year:
2010

Citing 4
Cited 0

Performance assessment of tandem connection of enhanced cellular coders

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Efficient fixed codebook search method for ACELP speech codecs

ICHIT'06 Proceedings of the 1st international conference on Advances in hybrid information technology
Improving the transcoding capability of speech coders

IEEE Transactions on Multimedia
ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications

IEEE Communications Magazine

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, an efficient transcoding algorithm between AMR-NB and G.729ab is proposed. The proposed algorithm further elaborates on solutions when a DTX function is adopted between the source and destination coding systems. When neither, either or both of the source and destination coding systems adopt the DTX function, the proposed algorithm can carry out the transcoding operation between the two coding systems efficiently. When neither of the two coding systems adopts the DTX function, transcoding methods in different domains are proposed. A scalable distortion measure method based on parameter domain, specifically related to codebook gain conversion, is proposed to keep the amplitude of synthesized speech. The effect on subjective speech quality due to the amplitude of synthesized speech is cancelled out by using the proposed method and the computational complexity is reduced as well. When either or both of the two coding systems adopt the DTX function, depending on the type of the destination frame, transcoding methods between speech frames and non-speech frames are proposed. When the frame is declared as an erased frame, a linear prediction-based pitch recovery and transcoding method is used in this paper. By employing the proposed algorithm in transcoders, complexity is reduced by about 26-82% and quality is also improved compared to the conventional DTE method.