Inferring Human Interactions in Meetings: A Multimodal Approach

Authors:
Zhiwen Yu;Zhiyong Yu;Yusa Ko;Xingshe Zhou;Yuichi Nakamura
Affiliations:
School of Computer Science, Northwestern Polytechnical University, P.R. China;School of Computer Science, Northwestern Polytechnical University, P.R. China;Academic Center for Computing and Media Studies, Kyoto University, Japan;School of Computer Science, Northwestern Polytechnical University, P.R. China;Academic Center for Computing and Media Studies, Kyoto University, Japan
Venue:
UIC '09 Proceedings of the 6th International Conference on Ubiquitous Intelligence and Computing
Year:
2009

Citing 16
Cited 2

Room with a Rear View: Meeting Capture in a Multimedia Conference Room

IEEE MultiMedia
The Conference Assistant: Combining Context-Awareness with Wearable Computing

ISWC '99 Proceedings of the 3rd IEEE International Symposium on Wearable Computers
Detection of agreement vs. disagreement in meetings: training with unlabeled data

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Towards a Smarter Meeting Record--Capture and Access of Meetings Revisited

Multimedia Tools and Applications
Online and off-line visualization of meeting information and meeting support

The Visual Computer: International Journal of Computer Graphics
Collaborative capturing, interpreting, and sharing of experiences

Personal and Ubiquitous Computing - Memory and Sharing of Experiences
Towards smart meeting: enabling technologies and a real-world application

Proceedings of the 9th international conference on Multimodal interfaces
Automatic inference of cross-modal nonverbal interactions in multiparty conversations: "who responds to whom, when, and how?" from gaze, head gestures, and utterances

Proceedings of the 9th international conference on Multimodal interfaces
Design of Software Architecture for Smart Meeting Space

PERCOM '08 Proceedings of the 2008 Sixth Annual IEEE International Conference on Pervasive Computing and Communications
Requirements and recommendations for an enhanced meeting viewing experience

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Role recognition for meeting participants: an approach based on lexical information and social network analysis

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Meeting mediator: enhancing group collaborationusing sociometric feedback

Proceedings of the 2008 ACM conference on Computer supported cooperative work
The impact of increased awareness while face-to-face

Human-Computer Interaction
Mixed reality participants in smart meeting rooms and smart home environments

Personal and Ubiquitous Computing
Smart meeting systems: A survey of state-of-the-art and open issues

ACM Computing Surveys (CSUR)
Discussion ontology: knowledge discovery from human activities in meetings

JSAI'06 Proceedings of the 20th annual conference on New frontiers in artificial intelligence

A multi-modal dialogue analysis method for medical interviews based on design of interaction corpus

Personal and Ubiquitous Computing
Social interaction mining in small group discussion using a smart meeting system

UIC'11 Proceedings of the 8th international conference on Ubiquitous intelligence and computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Social dynamics, such as human interaction is important for understanding how a conclusion was reached in a meeting and determining whether the meeting was well organized. In this paper, a multimodal approach is proposed to infer human semantic interactions in meeting discussions. The human interaction, such as proposing an idea, giving comments, expressing a positive opinion, etc., implies user role, attitude, or intention toward a topic. Our approach infers human interactions based on a variety of audiovisual and high-level features, e.g., gestures, attention, speech tone, speaking time, interaction occasion, and information about the previous interaction. Four different inference models including Support Vector Machine (SVM), Bayesian Net, Naïve Bayes, and Decision Tree are selected and compared in human interaction recognition. Our experimental results show that SVM outperforms other inference models, we can successfully infer human interactions with a recognition rate around 80%, and our multimodal approach achieves robust and reliable results by leveraging on the characteristics of each single modality.