Emotion recognition and conversion for mandarin speech

  • Authors:
  • Yu Zhou;Jianping Zhang;Ling Wang;Yonghong Yan

  • Affiliations:
  • ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences;ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences;ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences;ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences

  • Venue:
  • FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this study, some research activities on expressive speech recognition and conversion will be introduced. A database consisting of five kinds of speech emotions (i.e. happiness, sadness, surprise, anger and neutral) is used. Not only those traditional features such as mfcc, plp, and pitch are studied, but also a new feature extraction method based on fisher's F-Ratio is proposed and reported. In our experiments, various combinations of these features, including their high order features are applied using GMM modeling for Mandarin expressive speech recognition. Also we presented some results from emotional speech conversion with a pitch target model.