Audio-visual person authentication using lip-motion from orientation maps
Pattern Recognition Letters
Synergy of Lip-Motion and Acoustic Features in Biometric Speech and Speaker Recognition
IEEE Transactions on Computers
Automatic visual feature extraction for mandarin audio-visual speech recognition
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Text driven face-video synthesis using GMM and spatial correlation
SCIA'07 Proceedings of the 15th Scandinavian conference on Image analysis
Lip biometrics for digit recognition
CAIP'07 Proceedings of the 12th international conference on Computer analysis of images and patterns
Person identification using lip motion sequence
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part I
Pyramid based interpolation for face-video playback in audio visual recognition
ICB'07 Proceedings of the 2007 international conference on Advances in Biometrics
Speaker and digit recognition by audio-visual lip biometrics
ICB'07 Proceedings of the 2007 international conference on Advances in Biometrics
Hi-index | 0.00 |
This paper describes a new motion based feature extraction technique for speaker recognition using orientation estimation in 2D manifolds. The motion is estimated by computing the components of the structure tensor from which normal flows are extracted. By projecting the 3D spatiotemporal data to 2-D planes we obtain projection coefficients which we use to evaluate the 3-D orientations of brightness patterns in TV like 2D image sequences. This corresponds to the solutions of simple matrix eigenvalue problems in 2D, affording increased computational efficiency. An implementation based on joint lip movements and speech is presented along with experiments which confirm the theory, exhibiting a recognition rate of 98% on the publicly available XM2VTS database.