Ensemble methods for spoken emotion recognition in call-centres

  • Authors:
  • Donn Morrison;Ruili Wang;Liyanage C. De Silva

  • Affiliations:
  • Institute of Information Sciences and Technology, Massey University (Turitea), Palmerston North, Private Bag 11222, New Zealand;Institute of Information Sciences and Technology, Massey University (Turitea), Palmerston North, Private Bag 11222, New Zealand;Institute of Information Sciences and Technology, Massey University (Turitea), Palmerston North, Private Bag 11222, New Zealand

  • Venue:
  • Speech Communication
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Machine-based emotional intelligence is a requirement for more natural interaction between humans and computer interfaces and a basic level of accurate emotion perception is needed for computer systems to respond adequately to human emotion. Humans convey emotional information both intentionally and unintentionally via speech patterns. These vocal patterns are perceived and understood by listeners during conversation. This research aims to improve the automatic perception of vocal emotion in two ways. First, we compare two emotional speech data sources: natural, spontaneous emotional speech and acted or portrayed emotional speech. This comparison demonstrates the advantages and disadvantages of both acquisition methods and how these methods affect the end application of vocal emotion recognition. Second, we look at two classification methods which have not been applied in this field: stacked generalisation and unweighted vote. We show how these techniques can yield an improvement over traditional classification methods.