Partially Observable Markov Decision Process (POMDP) Technologies for Sign Language Based Human-Computer Interaction

Authors:
Sylvie C. Ong;David Hsu;Wee Sun Lee;Hanna Kurniawati
Affiliations:
School of Computing, National University of Singapore, Singapore 117417;School of Computing, National University of Singapore, Singapore 117417;School of Computing, National University of Singapore, Singapore 117417;School of Computing, National University of Singapore, Singapore 117417
Venue:
UAHCI '09 Proceedings of the 5th International Conference on Universal Access in Human-Computer Interaction. Part III: Applications and Services
Year:
2009

Citing 3
Cited 0

Recent developments in visual sign language recognition

Universal Access in the Information Society
Planning and acting in partially observable stochastic domains

Artificial Intelligence
Scaling POMDPs for Spoken Dialog Management

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Sign language (SL) recognition modules in human-computer interaction systems need to be both fast and reliable. In cases where multiple sets of features are extracted from the SL data, the recognition system can speed up processing by taking only a subset of extracted features as its input. However, this should not be realised at the expense of a drop in recognition accuracy. By training different recognizers for different subsets of features, we can formulate the problem as the task of planning the sequence of recognizer actions to apply to SL data, while accounting for the trade-off between recognition speed and accuracy. Partially observable Markov decision processes (POMDPs) provide a principled mathematical framework for such planning problems. A POMDP explicitly models the probabilities of observing various outputs from the individual recognizers and thus maintains a probability distribution (or belief) over the set of possible SL input sentences. It then computes a policy that maps every belief to an action. This allows the system to select actions in real-time during online policy execution, adapting its behaviour according to the observations encountered. We illustrate the POMDP approach with a simple sentence recognition problem and show in experiments the advantages of this approach over "fixed action" systems that do not adapt their behaviour in real-time.