Transformation of Speaker Characteristics in Speech Using Support Vector Machines

  • Authors:
  • K. Sreenivasa Rao;Shashidhar G. Koolagudi

  • Affiliations:
  • -;-

  • Venue:
  • ADCOM '07 Proceedings of the 15th International Conference on Advanced Computing and Communications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose Support Vector Machines (SVM) for transforming the speaker characteristics of the speech. Speaker characteristics are mainly influenced by the behavioural characteristics (prosody) of the speaker, characteristics of the vocal tract system and the excitation source. In this work speaker transformation indicates, modifying the speaker char- acteristics of the speech according to the desired speaker, and preserving the underlying message (sequence of sound units, i.e., text) same as in the original speech. This is per- formed by deriving the mapping functions for transforming the vocal tract characteristics and prosodic characteristics. SVMs are explored for deriving these mapping functions. The prosodic parameters and the characteristics of the vocal tract system and the excitation source of the target speaker are obtained from the output of the mapping functions. The manipulations of the prosodic parameters (durational char- acteristics, pitch contour (intonation pattern) and intensity patterns) are achieved by manipulating the Linear Predic- tion (LP) residual with the help of the knowledge of the in- stants of significant excitation. The modified LP residual is used to excite the time varying filter. The filter parameters are updated according to the desired vocal tract characteris- tics. The target speaker's speech is synthesized and evalu- ated using listening tests. The results of the listening tests indicate that the proposed mapping functions using SVMs provide the better speaker transformation compared to the earlier methods proposed by the author [1].