A Vision-Based Microphone Switch for Speech Intent Detection

Authors:
Giridharan Iyengar;Chalapathy Neti
Affiliations:
-;-
Venue:
RATFG-RTS '01 Proceedings of the IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems (RATFG-RTS'01)
Year:
2001

Citing 0
Cited 3

A multi-modal approach for determining speaker location and focus

Proceedings of the 5th international conference on Multimodal interfaces
Modeling multimodal integration patterns and performance in seniors: toward adaptive processing of individual differences

Proceedings of the 5th international conference on Multimodal interfaces
Private speech during multimodal human-computer interaction

Proceedings of the 6th international conference on Multimodal interfaces

Quantified Score

Hi-index	0.00

Visualization

Abstract

Abstract: In this paper we present our system for speech intent detection. In traditional desktop speech applications, the user has to explicitly indicate intent-to-speak to the computer by turning the microphone on. This is to alleviate problems associated with an open microphone in an automatic speech recognition system. In this paper, we use cues derived from user pose, proximity and visual speech activity to detect speech intent and enable automatic control of the microphone. We achieve real-time performance using pre-attentive cues to eliminate redundant computation.