Speech Synthesis with Various Emotional Expressions and Speaking Styles by Style Interpolation and Morphing

Authors:
Makoto Tachibana;Junichi Yamagishi;Takashi Masuko;Takao Kobayashi
Affiliations:
The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan. E-mail: makoto.tachibana@ip.titech.ac.jp, E-mail: ju ...;The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan. E-mail: makoto.tachibana@ip.titech.ac.jp, E-mail: ju ...;The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan. E-mail: makoto.tachibana@ip.titech.ac.jp, E-mail: ju ...;The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan. E-mail: makoto.tachibana@ip.titech.ac.jp, E-mail: ju ...
Venue:
IEICE - Transactions on Information and Systems
Year:
2005

Citing 0
Cited 11

A Style Control Technique for HMM-Based Expressive Speech Synthesis

IEICE - Transactions on Information and Systems
The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006

IEICE - Transactions on Information and Systems
Emotional speech synthesis by XML file using interactive genetic algorithms

Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
Review: Statistical parametric speech synthesis

Speech Communication
Integrating articulatory features into HMM-based parametric speech synthesis

IEEE Transactions on Audio, Speech, and Language Processing
Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis

Speech Communication
Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech

Speech Communication
Some aspects of ASR transcription based unsupervised speaker adaptation for HMM speech synthesis

TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model

Speech Communication
Expressive speech synthesis: a review

International Journal of Speech Technology
Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes an approach to generating speech with emotional expressivity and speaking style variability. The approach is based on a speaking style and emotional expression modeling technique for HMM-based speech synthesis. We first model several representative styles, each of which is a speaking style and/or an emotional expression, in an HMM-based speech synthesis framework. Then, to generate synthetic speech with an intermediate style from representative ones, we synthesize speech from a model obtained by interpolating representative style models using a model interpolation technique. We assess the style interpolation technique with subjective evaluation tests using four representative styles, i.e., neutral, joyful, sad, and rough in read speech and synthesized speech from models obtained by interpolating models for all combinations of two styles. The results show that speech synthesized from the interpolated model has a style in between the two representative ones. Moreover, we can control the degree of expressivity for speaking styles or emotions in synthesized speech by changing the interpolation ratio in interpolation between neutral and other representative styles. We also show that we can achieve style morphing in speech synthesis, namely, changing style smoothly from one representative style to another by gradually changing the interpolation ratio.