Adaptive fusion of acoustic and visual sources for automatic speech recognition
Speech Communication - Special issue on auditory-visual speech processing
Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications
ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Perspectives on the Contribution of Timbre to Musical Structure
Computer Music Journal
Noise Estimation from a Single Image
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
A tutorial on text-independent speaker verification
EURASIP Journal on Applied Signal Processing
Fuzzy declustering-based vector quantization
Pattern Recognition
An overview of text-independent speaker recognition: From features to supervectors
Speech Communication
Advances in view-invariant human motion analysis: a review
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Hi-index | 0.00 |
As one of non-contact biometric way for user authentication system of partner robots, visual-based recognition methods still suffer from the disturbance of light noise in the applications. Inspiring from the human's capability of compensating visual information (looking) with audio information(hearing), a visual-audio integrating method is proposed to reduce the disturbance of light noise and to improve the recognition accuracy. Combining with the PCA-based face recognition, a two-stage speaker recognition algorithm is used to extract useful personal identity information from speech signals. With the measurement of visual background noise, the visual-audio integrating method is performed to draw the final decision. The proposed method is evaluated on a public visual-audio dataset VidTIMIT and a partner robot authentication system. The results verified the visual-audio integrating method can obtain satisfied recognition results with strong robustness.