Principal Warps: Thin-Plate Splines and the Decomposition of Deformations
IEEE Transactions on Pattern Analysis and Machine Intelligence
Discrete-time signal processing
Discrete-time signal processing
Using Discriminant Eigenfeatures for Image Retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence
Probabilistic Visual Learning for Object Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
What Is the Set of Images of an Object Under All Possible Illumination Conditions?
International Journal of Computer Vision
Speaker transformation algorithm using segmental codebooks (STASC)
Speech Communication
Detecting Faces in Images: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence
High-resolution voice transformation
High-resolution voice transformation
Face recognition: A literature survey
ACM Computing Surveys (CSUR)
Audiovisual speech synchrony measure: application to biometrics
EURASIP Journal on Applied Signal Processing
A tutorial on text-independent speaker verification
EURASIP Journal on Applied Signal Processing
Journal of Cognitive Neuroscience
The BANCA database and evaluation protocol
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
A comparative evaluation of fusion strategies for multimodal biometric verification
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Multimodal decision-level fusion for person authentication
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Fusion of face and speech data for person identity verification
IEEE Transactions on Neural Networks
Identities, forgeries and disguises
International Journal of Information Technology and Management
Spoken dialogue in virtual worlds
COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony
Hi-index | 0.00 |
The robustness of a biometric identity verification (IV) system is best evaluated by monitoring its behavior under impostor attacks. Such attacks may include the transformation of one, many, or all of the biometric modalities. In this paper, we present the transformation of both speech and visual appearance of a speaker and evaluate its effects on the IV system. We propose MixTrans, a novel method for voice transformation. MixTrans is a mixture-structured bias voice transformation technique in the cepstral domain, which allows a transformed audio signal to be estimated and reconstructed in the temporal domain. We also propose a face transformation technique that allows a frontal face image of a client speaker to be animated. This technique employs principal warps to deform defined MPEG-4 facial feature points based on determined facial animation parameters (FAPs). The robustness of the IV system is evaluated under these attacks.