Talking-face identity verification, audiovisual forgery, and robustness issues

Authors:
Walid Karam;Hervé Bredin;Hanna Greige;Gérard Chollet;Chafic Mokbel
Affiliations:
Computer Science Department, University of Balamand, El-Koura, Lebanon;SAMoVA Team, IRIT-UMR, CNRS, Toulouse, France;Mathematics Department, University of Balamand, El-Koura, Lebanon;TSI, Ecole Nationale Supérieure des Télécommunications, Paris, France;Computer Science Department, University of Balamand, El-Koura, Lebanon
Venue:
EURASIP Journal on Advances in Signal Processing - Special issue on recent advances in biometric systems: a signal processing perspective
Year:
2009

Citing 17
Cited 2

Principal Warps: Thin-Plate Splines and the Decomposition of Deformations

IEEE Transactions on Pattern Analysis and Machine Intelligence
Discrete-time signal processing

Discrete-time signal processing
Using Discriminant Eigenfeatures for Image Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Probabilistic Visual Learning for Object Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
What Is the Set of Images of an Object Under All Possible Illumination Conditions?

International Journal of Computer Vision
Speaker transformation algorithm using segmental codebooks (STASC)

Speech Communication
Detecting Faces in Images: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
High-resolution voice transformation

High-resolution voice transformation
Face recognition: A literature survey

ACM Computing Surveys (CSUR)
Audiovisual speech synchrony measure: application to biometrics

EURASIP Journal on Applied Signal Processing
A tutorial on text-independent speaker verification

EURASIP Journal on Applied Signal Processing
Eigenfaces for recognition

Journal of Cognitive Neuroscience
The BANCA database and evaluation protocol

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
A comparative evaluation of fusion strategies for multimodal biometric verification

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
BIOMET: a multimodal person authentication database including face, voice, fingerprint, hand and signature modalities

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Multimodal decision-level fusion for person authentication

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Fusion of face and speech data for person identity verification

IEEE Transactions on Neural Networks

Identities, forgeries and disguises

International Journal of Information Technology and Management
Spoken dialogue in virtual worlds

COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony

Quantified Score

Hi-index	0.00

Visualization

Abstract

The robustness of a biometric identity verification (IV) system is best evaluated by monitoring its behavior under impostor attacks. Such attacks may include the transformation of one, many, or all of the biometric modalities. In this paper, we present the transformation of both speech and visual appearance of a speaker and evaluate its effects on the IV system. We propose MixTrans, a novel method for voice transformation. MixTrans is a mixture-structured bias voice transformation technique in the cepstral domain, which allows a transformed audio signal to be estimated and reconstructed in the temporal domain. We also propose a face transformation technique that allows a frontal face image of a client speaker to be animated. This technique employs principal warps to deform defined MPEG-4 facial feature points based on determined facial animation parameters (FAPs). The robustness of the IV system is evaluated under these attacks.