The role of voice quality in communicating emotion, mood and attitude
Speech Communication - Special issue on speech and emotion
Perceptually-Related F0 Parameters for Automatic Classification of Phrase Final Tones
IEICE - Transactions on Information and Systems
Analysis of the roles and the dynamics of breathy and whispery voice qualities in dialogue speech
EURASIP Journal on Audio, Speech, and Music Processing - Special issue on atypical speech
Evaluation of glottal closure instant detection in a range of voice qualities
Speech Communication
Improved automatic detection of creak
Computer Speech and Language
Hi-index | 0.00 |
The use of acoustic-prosodic features related to F0, duration and voice quality is proposed and evaluated for automatic extraction of paralinguistic information (intentions, attitudes, and emotions) in dialogue speech. Perceptual experiments and acoustic analyses were conducted for monosyllabic interjections spoken in several speaking styles, conveying a variety of paralinguistic information. Experimental results indicated that the classical prosodic features, i.e., F0 and duration, were effective for discriminating groups of paralinguistic information expressing intentions, such as affirm, deny, filler, and ask for repetition, and accounted for 57% of the global detection rate, in a task of discriminating seven groups of paralinguistic information. On the other hand, voice quality features were effective for identifying part of the paralinguistic information expressing emotions or attitudes, such as surprised, disgusted and admired, leading to a 12% improvement in the global detection rate.