Cross-Lingual Vocal Emotion Recognition in Five Native Languages of Assam Using Eigenvalue Decomposition

Authors:
Aditya Bihar Kandali;Aurobinda Routray;Tapan Kumar Basu
Affiliations:
Department of Electrical Engineering, Indian Institute of Technology, Kharagpur, India PIN Code-721302;Department of Electrical Engineering, Indian Institute of Technology, Kharagpur, India PIN Code-721302;Aliah University, Salt Lake City, Kolkota, India
Venue:
PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
Year:
2009

Citing 4
Cited 0

Digital spectral analysis: with applications

Digital spectral analysis: with applications
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Affective computing

Affective computing
Speech Synthesis and Recognition

Speech Synthesis and Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work investigates whether vocal emotion expressions of full-blown discrete emotions can be recognized cross-lingually. This study will enable us to get more information regarding nature and function of emotion. Furthermore, this work will help in developing a generalized vocal emotion recognition system, which will increase the efficiency required for human-machine interaction systems. An emotional speech database was created with 140 simulated utterances (20 per emotion) per speaker, consisting of short sentences of six full-blown discrete basic emotions and one 'no-emotion' (i.e. neutral) in five native languages (not dialects) of Assam. A new feature set is proposed based on Eigenvalues of Autocorrelation Matrix (EVAM) of each frame of utterance. The Gaussian Mixture Model is used as classifier. The performance of EVAM feature set is compared at two sampling frequencies (44.1 kHz and 8.1 kHz) and with additive white noise with signal-to-noise ratios of 0 db, 5 db, 10 db and 20 db.