Investigating the use of formant based features for detection of affective dimensions in speech

  • Authors:
  • Jonathan C. Kim;Hrishikesh Rao;Mark A. Clements

  • Affiliations:
  • School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta GA;School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta GA;School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta GA

  • Venue:
  • ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The ability of a machine to discern various categories of emotion is of great interest in many applications. This paper attempts to explore the use of baseline features consisting of prosodic and spectral features along with formant based features for the purpose of classification of emotion along the dimensions of arousal, valence, expectancy, and power. Using three feature selection criteria namely maximum average recall, maximal relevance, and minimal-redundancy-maximal-relevance, the paper intends to find the criterion that gives the highest unweighted accuracy. Using a Gaussian Mixture Model classifier, the results indicate that the formant based features show a statistically significant improvement on the accuracy of the classification system.