Spoken emotion recognition using hierarchical classifiers

Authors:
Enrique M. Albornoz;Diego H. Milone;Hugo L. Rufiner
Affiliations:
Centro de I+D en Señales, Sistemas e INteligencia Computacional (SINC(i)), Fac. de Ingeniería y Cs. Hídricas, Univ. Nacional del Litoral, Santa Fe, Argentina and Consejo Nacional de ...;Centro de I+D en Señales, Sistemas e INteligencia Computacional (SINC(i)), Fac. de Ingeniería y Cs. Hídricas, Univ. Nacional del Litoral, Santa Fe, Argentina and Consejo Nacional de ...;Centro de I+D en Señales, Sistemas e INteligencia Computacional (SINC(i)), Fac. de Ingeniería y Cs. Hídricas, Univ. Nacional del Litoral, Santa Fe, Argentina and Consejo Nacional de ...
Venue:
Computer Speech and Language
Year:
2011

Citing 15
Cited 3

Fundamentals of speech recognition

Fundamentals of speech recognition
Neural Networks: A Comprehensive Foundation

Neural Networks: A Comprehensive Foundation
Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals
Describing the emotional states that are expressed in speech

Speech Communication - Special issue on speech and emotion
Ensemble methods for spoken emotion recognition in call-centres

Speech Communication
Automatic discrimination between laughter and speech

Speech Communication
Applying an analysis of acted vocal emotions to improve the simulation of synthetic speech

Computer Speech and Language
Fear-type emotion recognition for future audio-based surveillance systems

Speech Communication
A Systematic Comparison of Different HMM Designs for Emotion Recognition from Acted and Spontaneous Speech

ACII '07 Proceedings of the 2nd international conference on Affective Computing and Intelligent Interaction
Recognizing emotions expressed by body pose: A biologically inspired neural model

Neural Networks
Emotion Recognition Based on Physiological Changes in Music Listening

IEEE Transactions on Pattern Analysis and Machine Intelligence
Emotion recognition from speech signals using new harmony features

Signal Processing
Spoken emotion recognition through optimum-path forest classification using glottal features

Computer Speech and Language
Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech

Computer Speech and Language
Detecting emotional state of a child in a conversational computer game

Computer Speech and Language

Emotion recognition using a hierarchical binary decision tree approach

Speech Communication
Fuzzy cognitive maps for artificial emotions forecasting

Applied Soft Computing
Class-specific multiple classifiers scheme to recognize emotions from speech signals

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

The recognition of the emotional state of speakers is a multi-disciplinary research area that has received great interest over the last years. One of the most important goals is to improve the voice-based human-machine interactions. Several works on this domain use the prosodic features or the spectrum characteristics of speech signal, with neural networks, Gaussian mixtures and other standard classifiers. Usually, there is no acoustic interpretation of types of errors in the results. In this paper, the spectral characteristics of emotional signals are used in order to group emotions based on acoustic rather than psychological considerations. Standard classifiers based on Gaussian Mixture Models, Hidden Markov Models and Multilayer Perceptron are tested. These classifiers have been evaluated with different configurations and input features, in order to design a new hierarchical method for emotion classification. The proposed multiple feature hierarchical method for seven emotions, based on spectral and prosodic information, improves the performance over the standard classifiers and the fixed features.