Audio, video and multimodal person identification in a smart room

Authors:
Jordi Luque;Ramon Morros;Ainara Garde;Jan Anguita;Mireia Farrus;Dušan Macho;Ferran Marqués;Claudi Martínez;Verónica Vilaplana;Javier Hernando
Affiliations:
Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain;Universitat Politècnica de Catalunya, Barcelona, Spain
Venue:
CLEAR'06 Proceedings of the 1st international evaluation conference on Classification of events, activities and relationships
Year:
2006

Citing 6
Cited 3

Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Time and frequency filtering of filter-bank energies for robust HMM speech recognition

Speech Communication - Special issue on noise robust ASR
Person Identification Using Multiple Cues

IEEE Transactions on Pattern Analysis and Machine Intelligence
Guide to Biometrics

Guide to Biometrics
Person identification using automatic integration of speech, lip, and face experts

WBMA '03 Proceedings of the 2003 ACM SIGMM workshop on Biometrics methods and applications
Improved audio-visual speaker recognition via the use of a hybrid combination strategy

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication

Context Awareness Triggered by Multiple Perceptual Analyzers

Proceedings of the 2007 conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in eHealth, HCI, Information Retrieval and Pervasive Technologies
Multimodal identification and tracking in smart environments

Personal and Ubiquitous Computing
Enhancing biometric recognition with spatio-temporal reasoning in smart environments

Personal and Ubiquitous Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining acoustic features and 2D face images. First we introduce the monomodal audio and video identification techniques and then we present the use of combined input speech and face images for person identification. The various sensory modalities, speech and faces, are processed both individually and jointly. It's shown that the multimodal approach results in improved performance in the identification of the participants.