Optimal weight tuning method for unit selection cost functions in syllable based text-to-speech synthesis

  • Authors:
  • N. P. Narendra;K. Sreenivasa Rao

  • Affiliations:
  • School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India

  • Venue:
  • Applied Soft Computing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a method for tuning the weights of unit selection cost functions in syllable based text-to-speech (TTS) synthesis system. In this work, unit selection cost functions, namely target cost and concatenation cost, are designed appropriate to syllables. The method tunes the weights in such a way that perceptual preference patterns are appropriately considered while selecting the units. The method uses genetic algorithm to derive the optimal weights. Fitness function is designed to map perceptual preference patterns into weights of unit selection cost functions. The effectiveness of proposed method is evaluated by both subjective and objective measures. From the results, it is observed that the derived optimal weights can synthesize good quality speech compared to manually tuned weights.