Speech emotional recognition using global and time sequence structure features with MMD

  • Authors:
  • Li Zhao;Yujia Cao;Zhiping Wang;Cairong Zou

  • Affiliations:
  • Research Center of Learning Science, Southeast University, Nanjing, China;Research Center of Learning Science, Southeast University, Nanjing, China;Department of Radio Engineering, Southeast University, Nanjing, China;Department of Radio Engineering, Southeast University, Nanjing, China

  • Venue:
  • ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, combined features of global and time-sequence were used as the characteristic parameters for speech emotional recognition. A new method based on formula of MMD (Modified Mahalanobis Distance) was proposed to decrease the estimated errors and simplify the calculation. Four emotions including happiness, anger, surprise and sadness are considered in the paper. 1000 recognizing sentences collected from 10 speakers were used to demonstrate the effectiveness of the new method. The average emotion recognition rate reached at 95%. Comparison with method of MQDF [1] (Modified quadratic discriminant function), Data analysis also displayed that the MMD is better than MQDF.