Extracting user preferences by GTM for aiGA weight tuning in unit selection text-to-speech synthesis

  • Authors:
  • Lluís Formiga;Francesc Alías

  • Affiliations:
  • Department of Communications and Signal Theory, Enginyeria i Arquitectura La Salle, Universitat Ramon Llull, Barcelona, Spain;Department of Communications and Signal Theory, Enginyeria i Arquitectura La Salle, Universitat Ramon Llull, Barcelona, Spain

  • Venue:
  • IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Unit-selection based Text-to-Speech synthesis systems aim to obtain high quality synthetic speech by optimally selecting previously recorded units. To that effect these units are selected by a dynamic programming algorithm guided through a weighted cost function. Thus, in this context, weights should be tuned perceptually so as to be in agreement with perception from listening users. In previous works we have proposed to subjectively tune these weights through an interactive evolutionary process, also known as Active Interactive Genetic Algorithm (aiGA). The problem comes out when different users, although being consistent, evolve to different weight configurations. In this proof-of-principle work, Generative Topographic Mapping (GTM) is introduced as a method to extract knowledge from user specific preferences. The experiments show that GTM is able to capture user preferences, thus, avoiding selecting the best evolved weight configuration by means of a second preference test.