Voice conversion algorithm

Authors:
Akash I. Mecwan;Vijay G. Savani;Shah Rajvi;Priya Vaya
Affiliations:
Nirma University, Ahmedabad, Gujarat, India;Nirma University, Ahmedabad, Gujarat, India;Nirma University, Ahmedabad, Gujarat, India;Nirma University, Ahmedabad, Gujarat, India
Venue:
Proceedings of the International Conference on Advances in Computing, Communication and Control
Year:
2009

Citing 2
Cited 1

Speaker transformation algorithm using segmental codebooks (STASC)

Speech Communication
Speech and Audio Signal Processing: Processing and Perception of Speech and Music

Speech and Audio Signal Processing: Processing and Perception of Speech and Music

3D Virtual worlds and the metaverse: Current status and future possibilities

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, a lot of work has been done in the speech technology. The main concentration being on Text-to-speech and automatic speech recognition techniques, voice conversion is yet an undeveloped and naïve field in Speech Technology and a lot of contribution from speech researchers is expected in upcoming days. In this paper an approach for static voice conversion is discussed. Static speech parameters are the parameters over which speaker has least control such as vocal tract structure, natural pitch of speech etc. Here, two main parameters are considered Vocal Tract Structure and Pitch. For conversion process speech is resolved in two components, excitation component and filtered component using a Linear Predictive coding [LPC] based source-filter. The pitch contour is determined by an autocorrelation. The excitation component is generated using a set of signal generators generating the determined pitch and are driven by voicing detection. Filter coefficients are modified to approach target speaker coefficients for voiced segments and for unvoiced segments filter coefficients of source are used straightaway.