Text driven face-video synthesis using GMM and spatial correlation

Authors:
Dereje Teferi;Maycel I. Faraj;Josef Bigun
Affiliations:
School of Information Science, Computer, and Electrical Engineering (IDE), Halmstad University, Halmstad, Sweden;School of Information Science, Computer, and Electrical Engineering (IDE), Halmstad University, Halmstad, Sweden;School of Information Science, Computer, and Electrical Engineering (IDE), Halmstad University, Halmstad, Sweden
Venue:
SCIA'07 Proceedings of the 15th Scandinavian conference on Image analysis
Year:
2007

Citing 8
Cited 0

Real-Time Motion-Based Frame Estimation in Video Lossy Transmission

SAINT '01 Proceedings of the 2001 Symposium on Applications and the Internet (SAINT 2001)
Enhancing security and privacy in biometrics-based authentication systems

IBM Systems Journal - End-to-end security
Biometrics: A Grand Challenge

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Vision with Direction: A Systematic Introduction to Image Processing and Computer Vision

Vision with Direction: A Systematic Introduction to Image Processing and Computer Vision
Evaluating Liveness by Face Images and the Structure Tensor

AUTOID '05 Proceedings of the Fourth IEEE Workshop on Automatic Identification Advanced Technologies
Person Verification by Lip-Motion

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Fast block matching algorithms for motion estimation

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 04
An introduction to biometric recognition

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Liveness detection is increasingly planned to be incorporated into biometric systems to reduce the risk of spoofing and impersonation. Some of the techniques used include detection of motion of the head while posing/speaking, iris size in varying illumination, fingerprint sweat, text-prompted speech, speech-to-lip motion synchronization etc. In this paper, we propose to build a biometric signal to test attack resilience of biometric systems by creating a text-driven video synthesis of faces. We synthesize new realistic looking video sequences from real image sequences representing utterance of digits. We determine the image sequences for each digit by using a GMM based speech recognizer. Then, depending on system prompt (sequence of digits) our method regenerates a video signal to test attack resilience of a biometric system that asks for random digit utterances to prevent play-back of pre-recorded data representing both audio and images. The discontinuities in the new image sequence, created at the connection of each digit, are removed by using a frame prediction algorithm that makes use of the well known block matching algorithm. Other uses of our results include web-based video communication for electronic commerce and frame interpolation for low frame rate video.