Speaker identification and verification using Gaussian mixture speaker models
Speech Communication
Video-Based Face Recognition Evaluation in the CHIL Project - Run 1
FGR '06 Proceedings of the 7th International Conference on Automatic Face and Gesture Recognition
CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Audio-visual multi-person tracking and identification for smart environments
Proceedings of the 15th international conference on Multimedia
Multimodal Technologies for Perception of Humans
Probabilistic integration of sparse audio-visual cues for identity tracking
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Hi-index | 0.00 |
In this paper, we presented three person identification systems that we have developed for the CLEAR evaluations. Two of the developed identification systems are based on single modalities- audio and video, whereas the third system uses both of these modalities. The visual identification system analyzes the face images of the individuals to determine the identity of the person. It processes multi-view, multi-frame information to provide the identity estimate. The speaker identification system processes the audio data from different channels and tries to determine the identity. The multi-modal identification system fuses the similarity scores obtained by the audio and video modalities to reach an identity estimate.