EAVA: A 3D Emotive Audio-Visual Avatar

  • Authors:
  • Hao Tang;Yun Fu;Jilin Tu;Thomas S. Huang;Mark Hasegawa-Johnson

  • Affiliations:
  • University of Illinois at Urbana-Champaign, 405 North Mathews Avenue, Urbana, IL 61801, USA, Email: haotang2@ifp.uiuc.edu.;University of Illinois at Urbana-Champaign, 405 North Mathews Avenue, Urbana, IL 61801, USA, Email: yunfu2@ifp.uiuc.edu.;University of Illinois at Urbana-Champaign, 405 North Mathews Avenue, Urbana, IL 61801, USA, Email: jilintu@ifp.uiuc.edu.;University of Illinois at Urbana-Champaign, 405 North Mathews Avenue, Urbana, IL 61801, USA, Email: huang@ifp.uiuc.edu.;University of Illinois at Urbana-Champaign, 405 North Mathews Avenue, Urbana, IL 61801, USA, Email: hasegawa@ifp.uiuc.edu.

  • Venue:
  • WACV '08 Proceedings of the 2008 IEEE Workshop on Applications of Computer Vision
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Emotive audio-visual avatars have the potential of significantly improving the quality of Human-Computer Interaction (HCI). In this paper, the various technical approaches of a novel framework leading to a text-driven 3D Emotive Audio-Visual Avatar (EAVA) are proposed. Primary work is focused on 3D face modeling, realistic emotional facial expression animation, emotive speech synthesis, and the co-articulation of speech gestures (i.e., lip movements due to speech production) and facial expressions. Experimental results clearly indicate that a certain degree of naturalness and expressiveness has been achieved by EAVA in both audio and visual aspects. Promising potential improvements can be expected by incorporating various data-driven statistical learning models into the framework.