Avatar-mediated face tracking and lip reading for human computer interaction

  • Authors:
  • Xiaozhou Wei;Lijun Yin;Zhiwei Zhu;Qiang Ji

  • Affiliations:
  • SUNY at Binghamton, Binghamton, NY;SUNY at Binghamton, Binghamton, NY;Rensselaer Polytech Institute, Troy, NY;Rensselaer Polytech Institute, Troy, NY

  • Venue:
  • Proceedings of the 12th annual ACM international conference on Multimedia
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Advanced human computer interaction requires automatic reading of human face in order to make the computer interact with human in the same way as human-to-human communication. We developed an automatic face tracking and lip reading system through a 3D face avatar to facilitate HCI applications in speech learning, emotional state monitoring, and non-verbal human computer interface design. The system implements a novel active face feature tracking algorithm with an uncalibrated camera. The 3D face pose is estimated and tracked by a Kalman filter-based matching process with a dynamic face model updating and constraint. The obtained facial motion parameters are transferred to an individualized 3D face avatar. As a result, a person's lip shape or expressions can be cloned to the animated 3D face avatar, by which all lip shapes from the same speech of different subjects can be easily compared and measured. This real time system targets the automatic facial expression analysis and synthesis for the next generation of HCI design.