Speech synthesis using stochastic Markov graphs

Authors:
M. Eichner;M. Wolff;S. Ohnewald;R. Hoffmann
Affiliations:
Lab. of Acoust. & Speech Connnunication, Dresden Univ. of Technol., Germany;-;-;-
Venue:
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Year:
2001

Citing 0
Cited 4

Review: Statistical parametric speech synthesis

Speech Communication
Analysis of verbal and nonverbal acoustic signals with the dresden UASR system

COST 2102'07 Proceedings of the 2007 COST action 2102 international conference on Verbal and nonverbal communication behaviours
An overview of text-to-speech synthesis techniques

CIT'10 Proceedings of the 4th international conference on Communications and information technology
An approach to intelligent signal processing

COST'11 Proceedings of the 2011 international conference on Cognitive Behavioural Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech synthesis systems basing on concatenation of natural speech segments achieve a high quality in terms of naturalness and intelligibility. However, in many applications such systems are not easy to apply because of the huge demand for storage capacity. Speech synthesis systems based on HMMs could be an alternative to concatenative speech synthesis systems but do not yet achieve the quality needed for use in applications. In one of our research projects we investigate the possibility of combining speech synthesis and speech recognition to a unified system using the same databases and similar algorithms for synthesis and recognition. In this context we examine the suitability of stochastic Markov graphs instead of HMMs to improve the performance of such synthesis systems. The paper describes the training procedure we used to train the SMGs, explains the synthesis process and introduces an algorithm for state selection and state duration modeling. We focus particularly on issues which arise using SMGs instead of HMMs.