New rule-based and data-driven strategy to incorporate Fujisaki's F/sub 0/ model to a text-to-speech system in Castillian Spanish

  • Authors:
  • J. M. Gutierrez-Arriola;J. M. Montero;D. Saiz;J. M. Pardo

  • Affiliations:
  • Dept. de Ingenieria Electron., Univ. Politecnica de Madrid, Spain;-;-;-

  • Venue:
  • ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the analysis of a Spanish prosody database by estimating the parameters of Fujisaki's (1981) model for F/sub 0/ contours. These parameters are classified attending to linguistic features and they form the analysis database. When synthesizing F/sub 0/ contours we extract the linguistic features from the text and perform a k-nearest neighbour search. Linguistic feature comparison distance is trained using data from the prosody database. To avoid artifacts we perform a rule-base filtering on synthesis parameters. The results of our evaluation test show that the proposed system is significantly better than the previous neural network approach. This evaluation confirms the ability of Fujisaki's model to represent prosody information based on linguistic features.