Real-time framework for multimodal human-robot interaction

  • Authors:
  • Jürgen Gast;Alexander Bannat;Tobias Rehrl;Frank Wallhoff;Gerhard Rigoll;Cornelia Wendt;Sabrina Schmidt;Michael Popp;Berthold Färber

  • Affiliations:
  • Institute for Human-Machine Communication, Department of Electrical Engineering and Information Technologies, Technische Universität München, Munich, Germany;Institute for Human-Machine Communication, Department of Electrical Engineering and Information Technologies, Technische Universität München, Munich, Germany;Institute for Human-Machine Communication, Department of Electrical Engineering and Information Technologies, Technische Universität München, Munich, Germany;Institute for Human-Machine Communication, Department of Electrical Engineering and Information Technologies, Technische Universität München, Munich, Germany;Institute for Human-Machine Communication, Department of Electrical Engineering and Information Technologies, Technische Universität München, Munich, Germany;Institut für Arbeitswissenschaften, Fakultät für Luft-und Raumfahrttechnik, Universität der Bundeswehr München, Munich, Germany;Institut für Arbeitswissenschaften, Fakultät für Luft-und Raumfahrttechnik, Universität der Bundeswehr München, Munich, Germany;Institut für Arbeitswissenschaften, Fakultät für Luft-und Raumfahrttechnik, Universität der Bundeswehr München, Munich, Germany;Institut für Arbeitswissenschaften, Fakultät für Luft-und Raumfahrttechnik, Universität der Bundeswehr München, Munich, Germany

  • Venue:
  • HSI'09 Proceedings of the 2nd conference on Human System Interactions
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new framework for multimodal data processing in real-time. This framework comprises modules for different input and output signals and was designed for human-human or human-robot interaction scenarios. Single modules for the recording of selected channels like speech, gestures or mimics can be combined with different output options (i.e. robot reactions) in a highly flexible manner. Depending on the included modules, online as well as offline data processing is possible. This framework was used to analyze human-human interaction to gain insights on important factors and their dynamics. Recorded data comprises speech, facial expressions, gestures and physiological data. This naturally produced data was annotated and labeled in order to train recognition modules which will be integrated into the existing framework. The overall aim is to create a system that is able to recognize and react to those parameters that humans take into account during interaction. In this paper, the technical implementation and application in a human-human and a human-robot interaction scenario is presented.