Audio-visual identity verification: an introductory overview

Authors:
Bouchra Abboud;Hervé Bredin;Guido Aversano;Gérard Chollet
Affiliations:
CNRS-LTCI, GET-ENST, Paris, France;CNRS-LTCI, GET-ENST, Paris, France;CNRS-LTCI, GET-ENST, Paris, France;CNRS-LTCI, GET-ENST, Paris, France
Venue:
Progress in nonlinear speech processing
Year:
2007

Citing 20
Cited 3

Speech analysis and synthesis methods developed at ECL in NTT-From LPC to LSP-

Speech Communication - Special issue: Speech research in Japan
Video Rewrite: driving visual speech with audio

Proceedings of the 24th annual conference on Computer graphics and interactive techniques
Synthesizing realistic facial expressions from photographs

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
Quantitative association of vocal-tract and facial behavior

Speech Communication - Special issue on auditory-visual speech processing
Active Appearance Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Trainable videorealistic speech animation

Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Face Recognition: Features Versus Templates

IEEE Transactions on Pattern Analysis and Machine Intelligence
Assessing face and speech consistency for monologue detection in video

Proceedings of the tenth ACM international conference on Multimedia
Face Recognition by Elastic Bunch Graph Matching

CAIP '97 Proceedings of the 7th International Conference on Computer Analysis of Images and Patterns
Efficient, Robust and Accurate Fitting of a 3D Morphable Model

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Face recognition: component-based versus global approaches

Computer Vision and Image Understanding - Special issue on Face recognition
Audio-visual synchrony for detection of monologues in video archives

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
GMM-based SVM for face recognition

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 03
An active model for facial feature tracking

EURASIP Journal on Applied Signal Processing
Eigenfaces for recognition

Journal of Cognitive Neuroscience
The BANCA database and evaluation protocol

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
BIOMET: a multimodal person authentication database including face, voice, fingerprint, hand and signature modalities

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Speaker association with signal-level audiovisual fusion

IEEE Transactions on Multimedia
Face recognition using the nearest feature line method

IEEE Transactions on Neural Networks

Multimodal Human Machine Interactions in Virtual and Augmented Reality

Multimodal Signals: Cognitive and Algorithmic Issues
Identities, forgeries and disguises

International Journal of Information Technology and Management
Audio-Visual feature fusion for speaker identification

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

Verification of identity is commonly achieved by looking at the face of a person and listening to his (her) speech. Automatic means of achieving this verification has been studied for several decades. Indeed, a talking face offers many features to achieve a robust verification of identity. The current deployment of videophones drives new opportunities for a secured access to remote servers (banking, certification, call centers, etc.). The synchrony of the speech signal and lip movements is a necessary condition to check that the observed talking face has not been manipulated and/or synthesized. This overview addresses face, speaker and talking face verification, as well as face and voice transformation techniques. It is demonstrated that a dedicated impostor needs limited information from a client to fool state of the art audio-visual identity verification systems.