Application of Expressive Speech in TTS System with Cepstral Description

  • Authors:
  • Jiří Přibil;Anna Přibilová

  • Affiliations:
  • Institute of Photonics and Electronics, Academy of Sciences CR, v.v.i., Prague 8, Czech Republic CZ-182 51;Faculty of Electrical Engineering & Information Technology, Dept. of Radio Electronics, Slovak University of Technology, Bratislava, Slovakia SK-812 19

  • Venue:
  • Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Expressive speech synthesis representing different human emotions has been in the interests of researchers for a longer time. Recently, some experiments with storytelling speaking style have been performed. This particular speaking style is suitable for applications aimed at children as well as special applications aimed at blind people. Analyzing human storytellers' speech, we designed a set of prosodic parameters prototypes for converting speech produced by the text-to-speech (TTS) system into storytelling speech. In addition to suprasegmental characteristics (pitch, intensity, and duration) included in these speech prototypes, also information about significant frequencies of spectral envelope and spectral flatness determining degree of voicing was used.