Low cost remote gaze gesture recognition in real time

  • Authors:
  • David Rozado;Francisco B. Rodriguez;Pablo Varona

  • Affiliations:
  • Escuela Politécnica Superior, Calle Francisco Tomás y Valiente, 11, Universidad Autónoma de Madrid, Madrid 28049, Spain;Escuela Politécnica Superior, Calle Francisco Tomás y Valiente, 11, Universidad Autónoma de Madrid, Madrid 28049, Spain;Escuela Politécnica Superior, Calle Francisco Tomás y Valiente, 11, Universidad Autónoma de Madrid, Madrid 28049, Spain

  • Venue:
  • Applied Soft Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Predefined sequences of eye movements, or 'gaze gestures', can be consciously performed by humans and monitored non-invasively using remote video oculography. Gaze gestures hold great potential in human-computer interaction, HCI, as long as they can be easily assimilated by potential users, monitored using low cost gaze tracking equipment and machine learning algorithms are able to distinguish the spatio-temporal structure of intentional gaze gestures from typical gaze activity performed during standard HCI. In this work, an evaluation of the performance of a bioinspired Bayesian pattern recognition algorithm known as Hierarchical Temporal Memory (HTM) on the real time recognition of gaze gestures is carried out through a user study. To improve the performance of traditional HTM during real time recognition, an extension of the algorithm is proposed in order to adapt HTM to the temporal structure of gaze gestures. The extension consists of an additional top node in the HTM topology that stores and compares sequences of input data by sequence alignment using dynamic programming. The spatio-temporal codification of a gesture in a sequence serves the purpose of handling the temporal evolution of gaze gestures instances. The extended HTM allows for reliable discrimination of intentional gaze gestures from otherwise standard human-machine gaze interaction reaching up to 98% recognition accuracy for a data set of 10 categories of gaze gestures, acceptable completion speeds and a low rate of false positives during standard gaze-computer interaction. These positive results despite the low cost hardware employed supports the notion of using gaze gestures as a new HCI paradigm for the fields of accessibility and interaction with smartphones, tablets, projected displays and traditional desktop computers.