Disambiguating speech commands using physical context

Authors:
Katherine M. Everitt;Susumu Harada;Jeff Bilmes;James A. Landay
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
Proceedings of the 9th international conference on Multimodal interfaces
Year:
2007

Citing 10
Cited 0

Fundamentals of speech recognition

Fundamentals of speech recognition
The coming age of calm technolgy

Beyond calculation
Understanding and Using Context

Personal and Ubiquitous Computing
CASIS: a context-aware speech interface system

Proceedings of the 10th international conference on Intelligent user interfaces
Battery-free Wireless Identification and Sensing

IEEE Pervasive Computing
Improving command and control speech recognition on mobile devices: using predictive user models for language modeling

User Modeling and User-Adapted Interaction
Real-time telephone-based speech recognition in the Jupiter domain

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
SketchWizard: Wizard of Oz prototyping of pen-based user interfaces

Proceedings of the 20th annual ACM symposium on User interface software and technology
Contextual coherence in natural language processing

CONTEXT'03 Proceedings of the 4th international and interdisciplinary conference on Modeling and using context
A wirelessly-powered platform for sensing and computation

UbiComp'06 Proceedings of the 8th international conference on Ubiquitous Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech has great potential as an input mechanism for ubiquitous computing. However, the current requirements necessary for accurate speech recognition, such as a quiet environment and a well-positioned and high-quality microphone, are unreasonable to expect in a realistic setting. In a physical environment, there is often contextual information which can be sensed and used to augment the speech signal. We investigated improving speech recognition rates for an electronic personal trainer using knowledge about what equipment was in use as context. We performed an experiment with participants speaking in an instrumented apartment environment and compared the recognition rates of a larger grammar with those of a smaller grammar that is determined by the context.