Mouth gesture and voice command based robot command interface

Authors:
Juan-Bernardo Gómez;Alexánder Ceballos;Flavio Prieto;Tanneguy Redarce
Affiliations:
Department of Electric, Electronic Engineering and Computing, National University of Colombia, Manizales and INSA de Lyon;National University of Colombia, Manizales;Department of Mechatronics, National University of Colombia, Bogotá;Faculty of Electric Engineering and Laboratoire Ampère, INSA Lyon, France
Venue:
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Year:
2009

Citing 3
Cited 1

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
Real-time robot manipulation using mouth gestures in facial video sequences

BVAI'07 Proceedings of the 2nd international conference on Advances in brain, vision and artificial intelligence
Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems

IEEE Transactions on Audio, Speech, and Language Processing

Robot Command Interface Using an Audio-Visual Speech Recognition System

CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a voice command and mouth gesture based robot command interface which is capable of controlling three degrees of freedom. The gesture set was designed in order to avoid head rotation and translation, and thus relying solely in mouth movements. Mouth segmentation is performed by using the normalized a* component, as in [1]. The gesture detection process is carried out by a Gaussian Mixture Model (GMM) based classifier. After that, a state machine stabilizes the system response by restricting the number of possible movements depending on the initial state. Voice commands are modeled using a Hidden Markov Model (HMM) isolated word recognition scheme. The interface was designed taking into account the specific pose restrictions found in the DaVinci Assisted Surgery command console.