Czech Pitch Contour Modeling Using Linear Prediction

Authors:
Petr Horák
Affiliations:
Department of Digital Signal Processing and Speech Synthesis Institute of Photonics and Electronics, Academy of Sciences of the Czech Republic, Praha 8, Czech Republic CZ-182 51
Venue:
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Year:
2008

Citing 1
Cited 0

Linear Prediction of Speech

Linear Prediction of Speech

Quantified Score

Hi-index	0.00

Visualization

Abstract

Present Czech TTS systems can produce synthetic speech with high intelligibility but low naturalness. The difference between natural and synthetic speech is still too high. Naturalness of the synthetic speech is given by the signal modeling and by the prosody modeling. This paper deals with the improving of the synthetic prosody modeling especially with the improving of the intonation modeling. A mathematical model of the pitch contour modeling can significantly limit the complexity of intonational rules creation and increase the naturalness of resulting synthetic speech. The linear prediction inonational model has been implemented into TTS system Epos for practical use. This built-in inonational model uses excitation by rules and provides in conjunction with a new triphone time domain inventories more naturalness synthetic speech than previous direct intonational rules.