Emotion Recognition through Multiple Modalities: Face, Body Gesture, Speech

Authors:
Ginevra Castellano;Loic Kessous;George Caridakis
Affiliations:
InfoMus Lab, DIST - University of Genova, Genova, Italy I-16145;Department of Speech, Language and Hearing, University of Tel Aviv, Sheba Center, Tel Aviv, Israel 52621;Image, Video and Multimedia Systems Laboratory, National Technical University of Athens, Athens, Greece 15780
Venue:
Affect and Emotion in Human-Computer Interaction
Year:
2008

Citing 16
Cited 16

Affective computing

Affective computing
Automatic Analysis of Facial Expressions: The State of the Art

IEEE Transactions on Pattern Analysis and Machine Intelligence
Toward Machine Emotional Intelligence: Analysis of Affective Physiological State

IEEE Transactions on Pattern Analysis and Machine Intelligence - Graph Algorithms and Computer Vision
Emotional speech: towards a new generation of databases

Speech Communication - Special issue on speech and emotion
Emotional posturing: a method towards achieving emotional figure animation

CA '97 Proceedings of the Computer Animation
Recognizing emotion from dance movement: comparison of spectator recognition and automated techniques

International Journal of Human-Computer Studies - Application of affective computing in human—Computer interaction
Analysis of emotion recognition using facial expressions, speech and multimodal information

Proceedings of the 6th international conference on Multimodal interfaces
Affective multimodal human-computer interaction

Proceedings of the 13th annual ACM international conference on Multimedia
2005 Special Issue: Emotion recognition through facial expression analysis based on a neurofuzzy network

Neural Networks - Special issue: Emotion and brain
A Bimodal Face and Body Gesture Database for Automatic Analysis of Human Nonverbal Affective Behavior

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 01
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Bi-modal emotion recognition from expressive face and body gestures

Journal of Network and Computer Applications
Parameterized facial expression synthesis based on MPEG-4

EURASIP Journal on Applied Signal Processing
Recognising Human Emotions from Body Movement and Gesture Dynamics

ACII '07 Proceedings of the 2nd international conference on Affective Computing and Intelligent Interaction
On biases in estimating multi-valued attributes

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Generalization of a vision-based computational model of mind-reading

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction

The Composite Sensing of Affect

Affect and Emotion in Human-Computer Interaction
Measuring Cognitive Workload in Non-military Scenarios Criteria for Sensor Technologies

FAC '09 Proceedings of the 5th International Conference on Foundations of Augmented Cognition. Neuroergonomics and Operational Neuroscience: Held as Part of HCI International 2009
EmoHeart: conveying emotions in second life based on affect sensing from text

Advances in Human-Computer Interaction - Special issue on emotion-aware natural interaction
Affect- and behaviour-related assistance for families in the home environment

Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments
Hybrid fusion approach for detecting affects from multichannel physiology

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
Adaptive facial expression recognition using inter-modal top-down context

ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Evaluation of user gestures in multi-touch interaction: a case study in pair-programming

ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Human face analysis: from identity to emotion and intention recognition

ICEB'10 Proceedings of the Third international conference on Ethics and Policy of Biometrics and International Data Sharing
Monitoring affect states during effortful problem solving activities

International Journal of Artificial Intelligence in Education
Detecting changing emotions in natural speech

IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
Consistent but modest: a meta-analysis on unimodal and multimodal affect detection accuracies from 30 studies

Proceedings of the 14th ACM international conference on Multimodal interaction
Mining for motivation: using a single wearable accelerometer to detect people's interests

Proceedings of the 2nd ACM international workshop on Interactive multimedia on mobile and portable devices
Affect detection from text-based virtual improvisation and emotional gesture recognition

Advances in Human-Computer Interaction
Systematic evaluation of social behaviour modelling with a single accelerometer

Proceedings of the 2013 ACM conference on Pervasive and ubiquitous computing adjunct publication
Towards affect sensitive and socially perceptive companions

Your Virtual Butler
Detecting changing emotions in human speech by machine and humans

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a multimodal approach for the recognition of eight emotions. Our approach integrates information from facial expressions, body movement and gestures and speech. We trained and tested a model with a Bayesian classifier, using a multimodal corpus with eight emotions and ten subjects. Firstly, individual classifiers were trained for each modality. Next, data were fused at the feature level and the decision level. Fusing the multimodal data resulted in a large increase in the recognition rates in comparison with the unimodal systems: the multimodal approach gave an improvement of more than 10% when compared to the most successful unimodal system. Further, the fusion performed at the feature level provided better results than the one performed at the decision level.