Emotion Recognition Based on Joint Visual and Audio Cues

Authors:
Nicu Sebe;Ira Cohen;Theo Gevers;Thomas S. Huang
Affiliations:
University of Amsterdam, The Netherlands;HP Labs, USA;University of Amsterdam, The Netherlands;University of Illinois at Urbana-Champaign, USA
Venue:
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 01
Year:
2006

Citing 0
Cited 22

Multimodal human-computer interaction: A survey

Computer Vision and Image Understanding
Affective multimodal mirror: sensing and eliciting laughter

Proceedings of the international workshop on Human-centered multimedia
A rank algebra to support multimedia mining applications

Proceedings of the 8th international workshop on Multimedia data mining: (associated with the ACM SIGKDD 2007)
Psychological responses to simulated displays of mismatched emotional expressions

Interacting with Computers
The emotion expression robot through the affective interaction: KOBIE

Proceedings of the 1st international conference on Robot communication and coordination
Affective Content Detection by Using Timing Features and Fuzzy Clustering

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Automatic temporal segment detection and affect recognition from face and body display

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on human computing
Affective Artificial Intelligence in Education: From Detection to Adaptation

Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Applying Affect Recognition in Serious Games: The PlayMancer Project

MIG '09 Proceedings of the 2nd International Workshop on Motion in Games
Unobtrusive multimodal emotion detection in adaptive interfaces: speech and facial expressions

FAC'07 Proceedings of the 3rd international conference on Foundations of augmented cognition
A motivational health companion in the home as part of an intelligent health monitoring sensor network

Proceedings of the 3rd international workshop on Affective interaction in natural environments
Evaluating multimodal affective fusion using physiological signals

Proceedings of the 16th international conference on Intelligent user interfaces
Recognizing affect from speech prosody using hierarchical graphical models

Speech Communication
Emotion recognition using bimodal data fusion

Proceedings of the 12th International Conference on Computer Systems and Technologies
Towards multimodal sentiment analysis: harvesting opinions from the web

ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Affect recognition based on physiological changes during the watching of music videos

ACM Transactions on Interactive Intelligent Systems (TiiS) - Special Issue on Affective Interaction in Natural Environments
The new italian audio and video emotional database

COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony
Human face analysis: from identity to emotion and intention recognition

ICEB'10 Proceedings of the Third international conference on Ethics and Policy of Biometrics and International Data Sharing
Towards sensing the influence of visual narratives on human affect

Proceedings of the 14th ACM international conference on Multimodal interaction
A multimodal fuzzy inference system using a continuous facial expression representation for emotion detection

Proceedings of the 14th ACM international conference on Multimodal interaction
A robust joint face model for human emotion recognition

Proceedings of the 27th Conference on Image and Vision Computing New Zealand
Inferring mood in ubiquitous conversational video

Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent technological advances have enabled human users to interact with computers in ways previously unimaginable. Beyond the confines of the keyboard and mouse, new modalities for human-computer interaction such as voice, gesture, and force-feedback are emerging. However, one necessary ingredient for natural interaction is still missing - emotions. This paper describes the problem of bimodal emotion recognition and advocates the use of probabilistic graphical models when fusing the different modalities. We test our audio-visual emotion recognition approach on 38 subjects with 11 HCI-related affect states. The experimental results show that the average person-dependent emotion recognition accuracy is greatly improved when both visual and audio information are used in classification.