Modelling and analyzing multimodal dyadic interactions using social networks

Authors:
Sergio Escalera;Petia Radeva;Jordi Vitrià;Xavier Baró;Bogdan Raducanu
Affiliations:
Universitat de Barcelona, Barcelona, Spain and Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain;Universitat de Barcelona, Barcelona, Spain and Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain;Universitat de Barcelona, Barcelona, Spain and Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain;Universitat Oberta de Catalunya, Barcelona, Spain and Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain;Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
Venue:
International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Year:
2010

Citing 10
Cited 1

Machine Learning for Sequential Data: A Review

Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Robust Real-Time Face Detection

International Journal of Computer Vision
Individual Centrality and Performance in Virtual R&D Groups: An Empirical Study

Management Science
Sensing and modeling human networks

Sensing and modeling human networks
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A social hypertext model for finding community in blogs

Proceedings of the seventeenth conference on Hypertext and hypermedia
Mining communities and their relationships in blogs: A study of online hate groups

International Journal of Human-Computer Studies
A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Stacked sequential learning

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Automatic role recognition in multiparty recordings: using social affiliation networks for feature extraction

IEEE Transactions on Multimedia

Socially-Competent Computing Implementing Social Sensor Design

International Journal of Web-Based Learning and Teaching Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Social network analysis became a common technique used to model and quantify the properties of social interactions. In this paper, we propose an integrated framework to explore the characteristics of a social network extracted from multimodal dyadic interactions. First, speech detection is performed through an audio/visual fusion scheme based on stacked sequential learning. In the audio domain, speech is detected through clusterization of audio features. Clusters are modelled by means of an One-state Hidden Markov Model containing a diagonal covariance Gaussian Mixture Model. In the visual domain, speech detection is performed through differential-based feature extraction from the segmented mouth region, and a dynamic programming matching procedure. Second, in order to model the dyadic interactions, we employed the Influence Model whose states encode the previous integrated audio/visual data. Third, the social network is extracted based on the estimated influences. For our study, we used a set of videos belonging to New York Times' Blogging Heads opinion blog. The results are reported both in terms of accuracy of the audio/visual data fusion and centrality measures used to characterize the social network.