Visual-audio integration for user authentication system of partner robots

Authors:
Jiangtao Cao;Naoyuki Kubota;Ping Li;Honghai Liu
Affiliations:
School of Information and Control Engineering, Liaoning Shihua University, China;Graduate School of System Design, Tokyo Metropolitan University, Japan;School of Information and Control Engineering, Liaoning Shihua University, China;School of Creative Technologies, University of Portsmouth, UK
Venue:
ICIRA'10 Proceedings of the Third international conference on Intelligent robotics and applications - Volume Part II
Year:
2010

Citing 9
Cited 0

Adaptive fusion of acoustic and visual sources for automatic speech recognition

Speech Communication - Special issue on auditory-visual speech processing
Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Perspectives on the Contribution of Timbre to Musical Structure

Computer Music Journal
Noise Estimation from a Single Image

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
A tutorial on text-independent speaker verification

EURASIP Journal on Applied Signal Processing
Fuzzy declustering-based vector quantization

Pattern Recognition
An overview of text-independent speaker recognition: From features to supervectors

Speech Communication
Advances in view-invariant human motion analysis: a review

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

As one of non-contact biometric way for user authentication system of partner robots, visual-based recognition methods still suffer from the disturbance of light noise in the applications. Inspiring from the human's capability of compensating visual information (looking) with audio information(hearing), a visual-audio integrating method is proposed to reduce the disturbance of light noise and to improve the recognition accuracy. Combining with the PCA-based face recognition, a two-stage speaker recognition algorithm is used to extract useful personal identity information from speech signals. With the measurement of visual background noise, the visual-audio integrating method is performed to draw the final decision. The proposed method is evaluated on a public visual-audio dataset VidTIMIT and a partner robot authentication system. The results verified the visual-audio integrating method can obtain satisfied recognition results with strong robustness.