Dimension reduction by local principal component analysis
Neural Computation
Coupled hidden Markov models for complex action recognition
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Noise adaptive stream weighting in audio-visual speech recognition
EURASIP Journal on Applied Signal Processing
An introduction to biometric recognition
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.00 |
Identity recognition in real environment with reliable mode is a key issue in human computer interaction (HCI). In this paper, we present a robust speaker identification system considering score based optimal reliability measure of different modalities. We propose an extension of the modified convection function's optimizing parameter to account optimal reliability simultaneously via audio and lip information based reliability measure in bimodal speaker identification system for robust speaker identification. For degradation of visual signals, we have applied JPEG compression to test images. In addition, for creating mismatch in between enrollment and test session, acoustic Babble noises and artificial illumination have been added to test audio and visual signals, respectively. Local PCA has been used to both modalities for reducing the dimension of feature vector. We have applied a swarm intelligence algorithm i.e., particle swarm optimization for optimizing the modified convection function's optimizing parameters. The overall speaker identification experiments are performed using VidTimit DB. Experimental results show that our proposed optimal reliability measures have effectively enhanced the identification accuracy of 7.73% in comparison with the best classifier system in the integration system and maintains the modality reliability statistics in term of its performance thus verified the consistency of the proposed extension.