Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces
IEEE Transactions on Pattern Analysis and Machine Intelligence
Time and frequency filtering of filter-bank energies for robust HMM speech recognition
Speech Communication - Special issue on noise robust ASR
Person Identification Using Multiple Cues
IEEE Transactions on Pattern Analysis and Machine Intelligence
Guide to Biometrics
Person identification using automatic integration of speech, lip, and face experts
WBMA '03 Proceedings of the 2003 ACM SIGMM workshop on Biometrics methods and applications
Improved audio-visual speaker recognition via the use of a hybrid combination strategy
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Context Awareness Triggered by Multiple Perceptual Analyzers
Proceedings of the 2007 conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in eHealth, HCI, Information Retrieval and Pervasive Technologies
Multimodal identification and tracking in smart environments
Personal and Ubiquitous Computing
Enhancing biometric recognition with spatio-temporal reasoning in smart environments
Personal and Ubiquitous Computing
Hi-index | 0.00 |
In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining acoustic features and 2D face images. First we introduce the monomodal audio and video identification techniques and then we present the use of combined input speech and face images for person identification. The various sensory modalities, speech and faces, are processed both individually and jointly. It's shown that the multimodal approach results in improved performance in the identification of the participants.