Can we hear the prosody of smile?

Authors:
Véronique Aubergé;Marie Cathiard
Affiliations:
Institut de la Communication Parlée, Université Stendhal/INPG, Du, 1180 Av Centrale, BP25-38040 Grenoble Cedex 9, France;Institut de la Communication Parlée, Université Stendhal/INPG, Du, 1180 Av Centrale, BP25-38040 Grenoble Cedex 9, France
Venue:
Speech Communication - Special issue on speech and emotion
Year:
2003

Citing 1
Cited 7

Describing the emotional states that are expressed in speech

Speech Communication - Special issue on speech and emotion

Emotion representation and physiology assignments in digital systems

Interacting with Computers
The vocal communication of different kinds of smile

Speech Communication
Towards an investigation of speech energetics using 'AnTon': an animatronic model of a human tongue and vocal tract

Connection Science - Language and Robots
The Cross-Modal and Cross-Cultural Processing of Affective Information

Proceedings of the 2011 conference on Neural Nets WIRN10: Proceedings of the 20th Italian Workshop on Neural Nets
When do we smile? analysis and modeling of the nonverbal context of listener smiles in conversation

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
The relative weights of the different prosodic dimensions in expressive speech: a resynthesis study

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
The role of voice quality and prosodic contour in affective speech perception

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

Visual expression alone (a smile or a laugh) is often enough to identify an emotion such as amusement: in a perception task, subjects correctly identified visual and audio-visual stimuli of amused speech to the same degree (Proceedings of ICSLP, Sydney, 1998, p. 559), but even from the acoustic signal alone it has been demonstrated that the consequences of a (mechanical) smile gesture can be perceived as amusement (JASA 96 (1994) 2101). A hypothesis developed in the present work is that the expression of amusement in speech involves specific control of prosody and cannot be reduced simply to a change in voice quality as a consequence of the facial smile gesture. Speech stimuli were produced by French speakers for various tasks (spontaneous amusement, simulated amusement, mechanical smiling, ...), and in a first experiment, listeners were able to identify speech from the spontaneous smile condition as more amused than the "mechanical smile". It was shown from a second experiment that, even under clear visual conditions, the auditory modality contributes to audio-visual perception. A McGurk paradigm applied to discordant amused/mechanical stimuli clearly showed that acoustic information interacts with the visual decoding. The stimuli were analysed using a set of parameters chosen following Tartter (Percept. Psychophys. 27 (1980) 24), Banse and Scherer (J. Pers. Soc. Psych. 170 (1996) 614) and Mozziconacci (PhD Thesis, Eindhoren University, 1998). The prosodic parameters affected in the expression of amusement are primarily intensity and F0 declination, but they are different for different speakers. Our results confirm the finding by Mozziconacci (PhD Thesis, Eindhoren University, 1998), that there may be numerous ways of using the same parameters to express emotions such as amusement..