Voice conversion based on probabilistic parameter transformation and extended inter-speaker residual prediction

Authors:
Zdeněk Hanzlíček;Jindřich Matoušek
Affiliations:
University of West Bohemia, Faculty of Applied Sciences, Department of Cybernetics, Plzeň, Czech Republic;University of West Bohemia, Faculty of Applied Sciences, Department of Cybernetics, Plzeň, Czech Republic
Venue:
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Year:
2007

Citing 3
Cited 1

High-resolution voice transformation

High-resolution voice transformation
Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
First steps towards new czech voice conversion system

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue

First Experiments on Text-to-Speech System Personification

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

Voice conversion is a process which modifies speech produced by one speaker so that it sounds as if it is uttered by another speaker. In this paper a new voice conversion system is presented. The system requires parallel training data. By using linear prediction analysis, speech is described with line spectral frequencies and the corresponding residua. LSFs are converted together with instantaneous F0 by joint probabilistic function. The residua are transformed by employing residual prediction. In this paper, a new modification of residual prediction is introduced which uses information on the desired target F0 to determine a proper residuum and it also allows an efficient control of F0 in resulting speech.