Voice conversion using linear prediction coefficients and artificial neural network

Authors:
Santosh Kumar Bharti;Shashidhar G. Koolagudi;K. Sreenivasa Rao;Ankur Choudhary;Binod Kumar
Affiliations:
Graphic Era University, Dehradun, Uttrakhand, India;Graphic Era University, Dehradun, Uttrakhand, India;Indian Institute of Technology, Kharagpur, West Bengal, India;Graphic Era University, Dehradun, Uttrakhand, India;Indian Institute of Technology, Kharagpur, West Bengal, India
Venue:
Proceedings of the CUBE International Information Technology Conference
Year:
2012

Citing 9
Cited 0

Acoustic characteristics of speaker individuality: control and conversion

Speech Communication - Special issue: voice conversion: state of the art and perspectives
Transformation of formants for voice conversion using artificial neural networks

Speech Communication - Special issue: voice conversion: state of the art and perspectives
Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Voice conversion using Artificial Neural Networks

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Voice conversion by mapping the speaker-specific features using pitch synchronous approach

Computer Speech and Language
Voice conversion using partial least squares regression

IEEE Transactions on Audio, Speech, and Language Processing
Voice conversion based on weighted frequency warping

IEEE Transactions on Audio, Speech, and Language Processing
Spectral mapping using artificial neural networks for voice conversion

IEEE Transactions on Audio, Speech, and Language Processing
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes the implementation of voice conversion system using linear prediction coefficients (LPC) and artificial neural networks (ANN). In this work, microphone recorded parallel speech utterances are used for voice conversion. Linear prediction coefficients are extracted from the speech utterance of the source and target speakers using linear prediction analysis. Auto associative neural networks (AANN) are configured and used for the transformation of LP coefficients from the source speaker to target speaker. LP residual obtained from target speaker and transformed LP coefficients are used to reconstruct the transformed speech. Mean opinion score and student t-test tests are conducted to evaluate the performance of voice conversion system. Evaluation of voice conversion system is performed in all 4 cases; such as male-to-male, male-to-female, female-to male and female-to-female. The test results show that the quality of transformed speech is good and AANN's have properly transformed LP coefficients of the source to that target speaker.