Bimodal HCI-related affect recognition

Authors:
Zhihong Zeng;Jilin Tu;Ming Liu;Tong Zhang;Nicholas Rizzolo;Zhenqiu Zhang;Thomas S. Huang;Dan Roth;Stephen Levinson
Affiliations:
University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign;University of Illinois at Urbana-Champaign
Venue:
Proceedings of the 6th international conference on Multimodal interfaces
Year:
2004

Citing 9
Cited 10

Affective computing

Affective computing
Learning to resolve natural language ambiguities: a unified approach

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Connected Vibrations: A Modal Analysis Approach for Non-Rigid Motion Tracking

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Multimodal Human Emotion/Expression Recognition

FG '98 Proceedings of the 3rd. International Conference on Face & Gesture Recognition
Bimodal Emotion Recognition

FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Joint processing of audio-visual information for the recognition of emotional expressions in human-computer interaction

Joint processing of audio-visual information for the recognition of emotional expressions in human-computer interaction
Speech under stress conditions: overview of the effect on speech production and on system performance

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 04
Face localization via hierarchical CONDENSATION with fisher boosting feature selection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Authentic facial expression analysis

FGR' 04 Proceedings of the Sixth IEEE international conference on Automatic face and gesture recognition

Training combination strategy of multi-stream fused hidden Markov model for audio-visual affect recognition

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Multimodal human-computer interaction: A survey

Computer Vision and Image Understanding
Detecting communication errors from visual cues during the system's conversational turn

Proceedings of the 9th international conference on Multimodal interfaces
Steps in Identifying Interaction Design Patterns for Multimodal Systems

HCSE-TAMODIA '08 Proceedings of the 2nd Conference on Human-Centered Software Engineering and 7th International Workshop on Task Models and Diagrams
A systematic discussion of fusion techniques for multi-modal affect recognition tasks

ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Multi-stream confidence analysis for audio-visual affect recognition

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
Multimodal human computer interaction: a survey

ICCV'05 Proceedings of the 2005 international conference on Computer Vision in Human-Computer Interaction
Emotion recognition using physiological and speech signal in short-term observation

PIT'06 Proceedings of the 2006 international tutorial and research conference on Perception and Interactive Technologies
A multimodal approach for online estimation of subtle facial expression

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Development process of an affective bi-modal Intelligent Tutoring System

Intelligent Decision Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Perhaps the most fundamental application of affective computing will be Human-Computer Interaction (HCI) in which the computer should have the ability to detect and track the user's affective states, and make corresponding feedback. The human multi-sensor affect system defines the expectation of multimodal affect analyzer. In this paper, we present our efforts toward audio-visual HCI-related affect recognition. With HCI applications in mind, we take into account some special affective states which indicate users' cognitive/motivational states. Facing the fact that a facial expression is influenced by both an affective state and speech content, we apply a smoothing method to extract the information of the affective state from facial features. In our fusion stage, a voting method is applied to combine audio and visual modalities so that the final affect recognition accuracy is greatly improved. We test our bimodal affect recognition approach on 38 subjects with 11 HCI-related affect states. The extensive experimental results show that the average person-dependent affect recognition accuracy is almost 90% for our bimodal fusion.