A segment-based approach to voice conversion

Authors:
M. Abe
Affiliations:
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
Venue:
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Year:
1991

Citing 0
Cited 4

Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Voice transformation using PSOLA technique

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency

Speech Communication
Spoken dialogue in virtual worlds

COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony

Quantified Score

Hi-index	0.00

Visualization

Abstract

A voice conversion algorithm that uses speech segments as conversion units is proposed. Input speech is decomposed into speech segments by a speech recognition module, and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also the dynamic characteristics of speaker individuality. The proposed voice conversion algorithm was used with two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum distortion between the two speakers. A listening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame.