Multidimensional Orientation Estimation with Applications to Texture Analysis and Optical Flow
IEEE Transactions on Pattern Analysis and Machine Intelligence
The nature of statistical learning theory
The nature of statistical learning theory
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Person Identification Using Multiple Cues
IEEE Transactions on Pattern Analysis and Machine Intelligence
Acoustic-labial Speaker Verification
AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication
Face Authentication with Sparse Grid Gabor Information
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 4 - Volume 4
Evaluating Liveness by Face Images and the Structure Tensor
AUTOID '05 Proceedings of the Fourth IEEE Workshop on Automatic Identification Advanced Technologies
Person Verification by Lip-Motion
CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
On the use of support vector machines for phonetic classification
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02
Audio-visual person authentication using lip-motion from orientation maps
Pattern Recognition Letters
Video based face recognition using multiple classifiers
FGR' 04 Proceedings of the Sixth IEEE international conference on Automatic face and gesture recognition
A review of speech-based bimodal recognition
IEEE Transactions on Multimedia
Lip biometrics for digit recognition
CAIP'07 Proceedings of the 12th international conference on Computer analysis of images and patterns
Combining dynamic texture and structural features for speaker identification
Proceedings of the 2nd ACM workshop on Multimedia in forensics, security and intelligence
Hi-index | 0.00 |
This paper proposes a new robust bi-modal audio visual digit and speaker recognition system by lip-motion and speech biometrics. To increase the robustness of digit and speaker recognition, we have proposed a method using speaker lip motion information extracted from video sequences with low resolution (128 ×128 pixels). In this paper we investigate a biometric system for digit recognition and speaker identification based using line-motion estimation with speech information and Support Vector Machines. The acoustic and visual features are fused at the feature level showing favourable results with digit recognition being 83% to 100% and speaker recognition 100% on the XM2VTS database.