Ensemble methods for spoken emotion recognition in call-centres

Authors:
Donn Morrison;Ruili Wang;Liyanage C. De Silva
Affiliations:
Institute of Information Sciences and Technology, Massey University (Turitea), Palmerston North, Private Bag 11222, New Zealand;Institute of Information Sciences and Technology, Massey University (Turitea), Palmerston North, Private Bag 11222, New Zealand;Institute of Information Sciences and Technology, Massey University (Turitea), Palmerston North, Private Bag 11222, New Zealand
Venue:
Speech Communication
Year:
2007

Citing 13
Cited 30

Instance-Based Learning Algorithms

Machine Learning
Original Contribution: Stacked generalization

Neural Networks
The nature of statistical learning theory

The nature of statistical learning theory
Selection of relevant features and examples in machine learning

Artificial Intelligence - Special issue on relevance
Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Random Forests

Machine Learning
Recognition of Affective Communicative Intent in Robot-Directed Speech

Autonomous Robots
How to Make Stacking Better and Faster While Also Taking Care of an Unknown Weakness

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
How to find trouble in communication

Speech Communication - Special issue on speech and emotion
Vocal communication of emotion: a review of research paradigms

Speech Communication - Special issue on speech and emotion
Emotion Recognition and Its Application to Computer Agents with Spontaneous Interactive Capabilities

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
Neural Networks: A Comprehensive Foundation (3rd Edition)

Neural Networks: A Comprehensive Foundation (3rd Edition)

Influence of contextual information in emotion annotation for spoken dialogue systems

Speech Communication
Recognizing low/high anger in speech for call centers

ISPRA'08 Proceedings of the 7th WSEAS International Conference on Signal Processing, Robotics and Automation
On the Use of Kappa Coefficients to Measure the Reliability of the Annotation of Non-acted Emotions

PIT '08 Proceedings of the 4th IEEE tutorial and research workshop on Perception and Interactive Technologies for Speech-Based Systems: Perception in Multimodal Dialogue Systems
Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Comparing emotions using acoustics and human perceptual dimensions

CHI '09 Extended Abstracts on Human Factors in Computing Systems
Comparison of Classification Methods for Detecting Emotion from Mandarin Speech

IEICE - Transactions on Information and Systems
Spectrum Modification for Emotional Speech Synthesis

Multimodal Signals: Cognitive and Algorithmic Issues
Recognition of Emotions in German Speech Using Gaussian Mixture Models

Multimodal Signals: Cognitive and Algorithmic Issues
Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification

Speech Communication
NMF features for speech emotion recognition

Proceedings of the 2009 International Conference on Hybrid Information Technology
Emotion recognition from speech signals using new harmony features

Signal Processing
Expression of affect in spontaneous speech: Acoustic correlates and automatic detection of irritation and resignation

Computer Speech and Language
Detecting emotional state of a child in a conversational computer game

Computer Speech and Language
Survey on speech emotion recognition: Features, classification schemes, and databases

Pattern Recognition
Classification of emotion in spoken Finnish using vowel-length segments: Increasing reliability with a fusion technique

Speech Communication
F2 -- new technique for recognition of user emotional states in spoken dialogue systems

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Spoken emotion recognition using hierarchical classifiers

Computer Speech and Language
Segment-based emotion recognition from continuous Mandarin Chinese speech

Computers in Human Behavior
Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge

Speech Communication
Formant position based weighted spectral features for emotion recognition

Speech Communication
Enhancement of emotion detection in spoken dialogue systems by combining several information sources

Speech Communication
Multiple feature extraction and hierarchical classifiers for emotions recognition

COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony
Elastic net for paralinguistic speech recognition

Proceedings of the 14th ACM international conference on Multimodal interaction
Cross-validation of bimodal health-related stress assessment

Personal and Ubiquitous Computing
Gender-dependent emotion recognition based on HMMs and SPHMMs

International Journal of Speech Technology
Dimensionality reduction-based spoken emotion recognition

Multimedia Tools and Applications
Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments

International Journal of Speech Technology
Modeling phonetic pattern variability in favor of the creation of robust emotion classifiers for real-life applications

Computer Speech and Language
Class-specific multiple classifiers scheme to recognize emotions from speech signals

Computer Speech and Language
Exploiting Psychological Factors for Interaction Style Recognition in Spoken Conversation

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Machine-based emotional intelligence is a requirement for more natural interaction between humans and computer interfaces and a basic level of accurate emotion perception is needed for computer systems to respond adequately to human emotion. Humans convey emotional information both intentionally and unintentionally via speech patterns. These vocal patterns are perceived and understood by listeners during conversation. This research aims to improve the automatic perception of vocal emotion in two ways. First, we compare two emotional speech data sources: natural, spontaneous emotional speech and acted or portrayed emotional speech. This comparison demonstrates the advantages and disadvantages of both acquisition methods and how these methods affect the end application of vocal emotion recognition. Second, we look at two classification methods which have not been applied in this field: stacked generalisation and unweighted vote. We show how these techniques can yield an improvement over traditional classification methods.