New rule-based and data-driven strategy to incorporate Fujisaki's F/sub 0/ model to a text-to-speech system in Castillian Spanish

Authors:
J. M. Gutierrez-Arriola;J. M. Montero;D. Saiz;J. M. Pardo
Affiliations:
Dept. de Ingenieria Electron., Univ. Politecnica de Madrid, Spain;-;-;-
Venue:
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Year:
2001

Citing 0
Cited 1

Applying data mining techniques to corpus based prosodic modeling

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present the analysis of a Spanish prosody database by estimating the parameters of Fujisaki's (1981) model for F/sub 0/ contours. These parameters are classified attending to linguistic features and they form the analysis database. When synthesizing F/sub 0/ contours we extract the linguistic features from the text and perform a k-nearest neighbour search. Linguistic feature comparison distance is trained using data from the prosody database. To avoid artifacts we perform a rule-base filtering on synthesis parameters. The results of our evaluation test show that the proposed system is significantly better than the previous neural network approach. This evaluation confirms the ability of Fujisaki's model to represent prosody information based on linguistic features.