Toward a rule-based synthesis of emotional speech on linguistic descriptions of perception

Authors:
Chun-Fang Huang;Masato Akagi
Affiliations:
School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan;School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan
Venue:
ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
Year:
2005

Citing 5
Cited 1

Implementation and testing of a system for producing emotion-by-rule in synthetic speech

Speech Communication
Affective computing

Affective computing
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds

Speech Communication
The humane interface: new directions for designing interactive systems

The humane interface: new directions for designing interactive systems
A corpus-based speech synthesis system with emotion

Speech Communication - Special issue on speech and emotion

A three-layered model for expressive speech perception

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper reports rules for morphing a voice to make it be perceived as containing various primitive features, for example, to make it sound more “bright” or “dark”. In a previous work we proposed a three-layered model, which contains emotional speech, primitive features, and acoustic features, for the perception of emotional speech. By experiments and acoustic analysis, we built the relationships between the three layers and reported that such relationships are significant. Then, a bottom-up method was adopted in order to verify the relationships. That is, we morphed (resynthesized) a speech voice by composing acoustic features in the bottommost layer to produce a voice in which listeners could perceive a single or multiple primitive features, which could be further perceived as different categories of emotion. The intermediate results show that the relationships of the model built in previous work are valid.