The production and recognition of emotions in speech: features and algorithms

Authors:
Oudeyer Pierre-Yves
Affiliations:
Sony CSL Paris, 6, rue Amyot, 75005 Paris, France
Venue:
International Journal of Human-Computer Studies - Application of affective computing in human—Computer interaction
Year:
2003

Citing 9
Cited 83

Automatic recognition and analysis of human faces and facial expressions: a survey

Pattern Recognition
MBR-PSOLA: Text-To-Speech synthesis based on an MBE re-synthesis of the segments database

Speech Communication - Speech science and technology: a selection from the papers presented at the Fourth International Conference in Speech Science and Technology (SST-92)
Implementation and testing of a system for producing emotion-by-rule in synthetic speech

Speech Communication
Affective computing

Affective computing
Robots for kids: exploring new technologies for learning

Robots for kids: exploring new technologies for learning
Designing Sociable Robots

Designing Sociable Robots
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Development of an Autonomous Quadruped Robot for Robot Entertainment

Autonomous Robots - Special issue on autonomous agents
Data Mining

Data Mining

The utility of affect expression in natural language interactions in joint human-robot tasks

Proceedings of the 1st ACM SIGCHI/SIGART conference on Human-robot interaction
Modeling naturalistic affective states via facial and vocal expressions recognition

Proceedings of the 8th international conference on Multimodal interfaces
Human computing and machine understanding of human behavior: a survey

Proceedings of the 8th international conference on Multimodal interfaces
Predicting student emotions in computer-human tutoring dialogues

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech

Speech Communication
Primitives-based evaluation and estimation of emotions in speech

Speech Communication
Multimodal human-computer interaction: A survey

Computer Vision and Image Understanding
A extraction of emotion in human speech using speech synthesize and each classifier for each emotion

ACS'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Computer Science - Volume 7
Cognitive modeling in software and relation to human emotional reasoning

ACS'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Computer Science - Volume 7
The framework of the speech communication system with emotion processing

AIKED'07 Proceedings of the 6th Conference on 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and Data Bases - Volume 6
Fear-type emotion recognition for future audio-based surveillance systems

Speech Communication
Bimodal person-dependent emotion recognition comparison of feature level and decision level information fusion

Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments
Dancing the night away: controlling a virtual karaoke dancer by multimodal expressive cues

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Fast and accurate sequential floating forward feature selection with the Bayes classifier applied to speech emotion recognition

Signal Processing
Emotions in Speech: Juristic Implications

Speaker Classification I
Automatic Classification of Expressiveness in Speech: A Multi-corpus Study

Speaker Classification II
Automatic Recognition of Emotions from Speech: A Review of the Literature and Recommendations for Practical Realisation

Affect and Emotion in Human-Computer Interaction
Real-Time Emotion Recognition from Speech Using Echo State Networks

ANNPR '08 Proceedings of the 3rd IAPR workshop on Artificial Neural Networks in Pattern Recognition
EmoVoice -- A Framework for Online Recognition of Emotions from Voice

PIT '08 Proceedings of the 4th IEEE tutorial and research workshop on Perception and Interactive Technologies for Speech-Based Systems: Perception in Multimodal Dialogue Systems
An extraction of emotion in human speech using speech synthesize and classifiers for each emotion

WSEAS Transactions on Information Science and Applications
Exploiting a Vowel Based Approach for Acted Emotion Recognition

Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction
Intelligent human interface based on mental cloning-based software

Knowledge-Based Systems
Emotional speech synthesis by XML file using interactive genetic algorithms

Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
An action decision model for emotions based on transactional analysis

AIKED'09 Proceedings of the 8th WSEAS international conference on Artificial intelligence, knowledge engineering and data bases
Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification

Speech Communication
Cognitive Modeling in Software and Relation to Human Emotional Reasoning

Proceedings of the 2007 conference on New Trends in Software Methodologies, Tools and Techniques: Proceedings of the sixth SoMeT_07
Audio-Based Emotion Recognition in Judicial Domain: A Multilayer Support Vector Machines Approach

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Exploration of Affect Sensing from Speech and Metaphorical Text

Edutainment '09 Proceedings of the 4th International Conference on E-Learning and Games: Learning by Playing. Game-based Education System Design and Development
Human perception of audio-visual synthetic character emotion expression in the presence of ambiguous and conflicting information

IEEE Transactions on Multimedia
Virtual Medical Doctor Interaction Based on Transactional Analysis

Proceedings of the 2009 conference on New Trends in Software Methodologies, Tools and Techniques: Proceedings of the Eighth SoMeT_09
A Study of How to Implement a Listener Estimate Emotion in Speech

Proceedings of the 2009 conference on New Trends in Software Methodologies, Tools and Techniques: Proceedings of the Eighth SoMeT_09
Affect Recognition from Speech

Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
An intelligent agent with affect sensing from metaphorical language and speech

Proceedings of the International Conference on Advances in Computer Enterntainment Technology
The GMM-SVM Supervector Approach for the Recognition of the Emotional Status from Speech

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
Exploring Speech Features for Classifying Emotions along Valence Dimension

PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
Combination of generative models and SVM based classifier for speech emotion recognition

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Multi-stage classification of emotional speech motivated by a dimensional emotion model

Multimedia Tools and Applications
Multimodal interfaces: Challenges and perspectives

Journal of Ambient Intelligence and Smart Environments
Extracting emotion from speech: towards emotional speech-driven facial animations

SG'03 Proceedings of the 3rd international conference on Smart graphics
Validation of an expressive speech corpus by mapping automatic classification to subjective evaluation

IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Validating a multilingual and multimodal affective database

UI-HCII'07 Proceedings of the 2nd international conference on Usability and internationalization
Objective and subjective evaluation of an expressive speech corpus

NOLISP'07 Proceedings of the 2007 international conference on Advances in nonlinear speech processing
Multimodal information fusion application to human emotion recognition from face and speech

Multimedia Tools and Applications
Toward emotion aware computing: an integrated approach using multichannel neurophysiological recordings and affective visual stimuli

IEEE Transactions on Information Technology in Biomedicine - Special section on new and emerging technologies in bioinformatics and bioengineering
Automatic inference of complex affective states

Computer Speech and Language
Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech

Computer Speech and Language
Expression of affect in spontaneous speech: Acoustic correlates and automatic detection of irritation and resignation

Computer Speech and Language
Designing and evaluating a wizarded uncertainty-adaptive spoken dialogue tutoring system

Computer Speech and Language
Interpreting non-linguistic utterances by robots: studying the influence of physical appearance

Proceedings of the 3rd international workshop on Affective interaction in natural environments
Review: Some background on dialogue management and conversational speech for dialogue systems

Computer Speech and Language
A prototype for a conversational companion for reminiscing about images

Computer Speech and Language
Survey on speech emotion recognition: Features, classification schemes, and databases

Pattern Recognition
Human computing and machine understanding of human behavior: a survey

ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
Modeling naturalistic affective states via facial, vocal, and bodily expressions recognition

ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
An extraction of emotion in human speech using cluster analysis and a regression tree

ACS'10 Proceedings of the 10th WSEAS international conference on Applied computer science
Classification of emotion in spoken Finnish using vowel-length segments: Increasing reliability with a fusion technique

Speech Communication
Segment-based emotion recognition from continuous Mandarin Chinese speech

Computers in Human Behavior
Towards the detection of social dominance in dialogue

Speech Communication
Emotional states in judicial courtrooms: An experimental investigation

Speech Communication
Reading desk for preschool children and older people with emotional speech synthesis

ICHIT'11 Proceedings of the 5th international conference on Convergence and hybrid information technology
Emotion twenty questions: toward a crowd-sourced theory of emotions

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
Speech emotion recognition system based on L1 regularized linear regression and decision fusion

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
Multiple classifier systems for the classificatio of audio-visual emotional states

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
Audio-based emotion recognition from natural conversations based on co-occurrence matrix and frequency domain energy distribution features

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
An ontology for description of emotional cues

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
A multimodal database as a background for emotional synthesis, recognition and training in e-learning systems

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
Expressive speech recognition and synthesis as enabling technologies for affective robot-child communication

PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Multimodal human computer interaction: a survey

ICCV'05 Proceedings of the 2005 international conference on Computer Vision in Human-Computer Interaction
A multitask approach to continuous five-dimensional affect sensing in natural speech

ACM Transactions on Interactive Intelligent Systems (TiiS) - Special Issue on Affective Interaction in Natural Environments
Multiple classifier systems for the recogonition of human emotions

MCS'10 Proceedings of the 9th international conference on Multiple Classifier Systems
Affect prediction from physiological measures via visual stimuli

International Journal of Human-Computer Studies
Emotion recognition from speech: a review

International Journal of Speech Technology
Emotion recognition from speech using source, system, and prosodic features

International Journal of Speech Technology
Advanced authoring tools for game-based training

SCSC '09 Proceedings of the 2009 Summer Computer Simulation Conference
Multimodal interfaces: Challenges and perspectives

Journal of Ambient Intelligence and Smart Environments
Elastic net for paralinguistic speech recognition

Proceedings of the 14th ACM international conference on Multimodal interaction
Measuring instant emotions based on facial expressions during computer-based assessment

Personal and Ubiquitous Computing
Directing robot motions with paralinguistic information

Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Gender-dependent emotion recognition based on HMMs and SPHMMs

International Journal of Speech Technology
Voice augmented manipulation: using paralinguistic information to manipulate mobile devices

Proceedings of the 15th international conference on Human-computer interaction with mobile devices and services
A satisfaction-based model for affect recognition from conversational features in spoken dialog systems

Speech Communication
Situational context directs how people affectively interpret robotic non-linguistic utterances

Proceedings of the 2014 ACM/IEEE international conference on Human-robot interaction
Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents algorithms that allow a robot to express its emotions by modulating the intonation of its voice. They are very simple and efficiently provide life-like speech thanks to the use of concatenative speech synthesis. We describe a technique which allows to continuously control both the age of a synthetic voice and the quantity of emotions that are expressed. Also, we present the first large-scale data mining experiment about the automatic recognition of basic emotions in informal everyday short utterances. We focus on the speaker-dependent problem. We compare a large set of machine learning algorithms, ranging from neural networks, Support Vector Machines or decision trees, together with 200 features, using a large database of several thousands examples. We show that the difference of performance among learning schemes can be substantial, and that some features which were previously unexplored are of crucial importance. An optimal feature set is derived through the use of a genetic algorithm. Finally, we explain how this study can be applied to real world situations in which very few examples are available. Furthermore, we describe a game to play with a personal robot which facilitates teaching of examples of emotional utterances in a natural and rather unconstrained manner.