Acoustic characteristics of speaker individuality: control and conversion
Speech Communication - Special issue: voice conversion: state of the art and perspectives
Transformation of formants for voice conversion using artificial neural networks
Speech Communication - Special issue: voice conversion: state of the art and perspectives
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Voice conversion using Artificial Neural Networks
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Voice conversion by mapping the speaker-specific features using pitch synchronous approach
Computer Speech and Language
Voice conversion using partial least squares regression
IEEE Transactions on Audio, Speech, and Language Processing
Voice conversion based on weighted frequency warping
IEEE Transactions on Audio, Speech, and Language Processing
Spectral mapping using artificial neural networks for voice conversion
IEEE Transactions on Audio, Speech, and Language Processing
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
This paper describes the implementation of voice conversion system using linear prediction coefficients (LPC) and artificial neural networks (ANN). In this work, microphone recorded parallel speech utterances are used for voice conversion. Linear prediction coefficients are extracted from the speech utterance of the source and target speakers using linear prediction analysis. Auto associative neural networks (AANN) are configured and used for the transformation of LP coefficients from the source speaker to target speaker. LP residual obtained from target speaker and transformed LP coefficients are used to reconstruct the transformed speech. Mean opinion score and student t-test tests are conducted to evaluate the performance of voice conversion system. Evaluation of voice conversion system is performed in all 4 cases; such as male-to-male, male-to-female, female-to male and female-to-female. The test results show that the quality of transformed speech is good and AANN's have properly transformed LP coefficients of the source to that target speaker.