Identifying speakers using their emotion cues

Authors:
Ismail Shahin
Affiliations:
Electrical and Computer Engineering Department, University of Sharjah, Sharjah, United Arab Emirates
Venue:
International Journal of Speech Technology
Year:
2011

Citing 7
Cited 2

Speaker-dependent-feature extraction, recognition and processing techniques

Speech Communication - Special issue on speaker characterization in speech terminology
Fundamentals of speech recognition

Fundamentals of speech recognition
Hidden Markov Models for Speech Recognition

Hidden Markov Models for Speech Recognition
Emotions, speech and the ASR framework

Speech Communication - Special issue on speech and emotion
Speaker identification in the shouted environment using Suprasegmental Hidden Markov Models

Signal Processing
Modulation spectral features for robust far-field speaker identification

IEEE Transactions on Audio, Speech, and Language Processing
Emotion-State conversion for speaker recognition

ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction

Gender-dependent emotion recognition based on HMMs and SPHMMs

International Journal of Speech Technology
Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the formulation of a new speaker identification approach which employs knowledge of emotional content of speaker information. Our proposed approach in this work is based on a two-stage recognizer that combines and integrates both emotion recognizer and speaker recognizer into one recognizer. The proposed approach employs both Hidden Markov Models (HMMs) and Suprasegmental Hidden Markov Models (SPHMMs) as classifiers. In the experiments, six emotions are considered including neutral, angry, sad, happy, disgust and fear. Our results show that average speaker identification performance based on the proposed two-stage recognizer is 79.92% with a significant improvement over a one-stage recognizer with an identification performance of 71.58%. The results obtained based on the proposed approach are close to those achieved in subjective evaluation by human listeners.