Meeting state recognition from visual and aural labels

Authors:
Jan Cuřín;Pascal Fleury;Jan Kleindienst;Robert Kessl
Affiliations:
IBM, Prague, Czech Republic;IBM, Prague, Czech Republic;IBM, Prague, Czech Republic;IBM, Prague, Czech Republic
Venue:
MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
Year:
2007

Citing 8
Cited 1

Perceptual Components for Context Aware Computing

UbiComp '02 Proceedings of the 4th international conference on Ubiquitous Computing
Ontology and Taxonomy Collaborated Framework for Meeting Classification

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 4 - Volume 04
Automatic Analysis of Multimodal Group Actions in Meetings

IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Proceedings of the 1st international evaluation conference on Classification of events, activities and relationships

CLEAR'06 Proceedings of the 1st international evaluation conference on Classification of events, activities and relationships
The AMI meeting corpus: a pre-announcement

MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
VACE multimodal meeting corpus

MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
The connector service-predicting availability in mobile contexts

MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction

SitCom: Virtual Smart-Room Environment for Multi-modal Perceptual Systems

IEA/AIE '08 Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a meeting state recognizer based on a combination of multi-modal sensor data in a smart room. Our approach is based on the training of a statistical model to use semantical cues generated by perceptual components. These perceptual components generate these cues in processing the output of one or multiple sensors. The presented recognizer is designed to work with an arbitrary combination of multi-modal input sensors. We have defined a set of states representing both meeting and non-meeting situations, and a set of features we base our classification on. Thus, we can model situations like presentation or break which are important information for many applications. We have hand-annotated a set of meeting recordings to verify our statistical classification, as appropriate multi-modal corpora are currently very sparse. We have also used several statistical classification methods for the best classification, which we validated on the hand-annotated corpus of real meeting data.