Methods for stress classification: nonlinear TEO and linear speech based features

Authors:
Guojun Zhou;J. H. L. Hansen;J. F. Kaiser
Affiliations:
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA;-;-
Venue:
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 04
Year:
1999

Citing 0
Cited 4

Modeling drivers' speech under stress

Speech Communication - Special issue on speech and emotion
Preliminary study of stress/neutral detection on recordings of children in the natural home environment

Proceedings of the 2nd Workshop on Child, Computer and Interaction
Spoken emotion recognition through optimum-path forest classification using glottal features

Computer Speech and Language
Piecing together the emotion jigsaw

MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech production variations due to perceptually induced stress contribute significantly to reduced speech processing performance. One approach that can improve the robustness of speech processing (e.g., recognition) algorithms against stress is to formulate an objective classification of speaker stress based upon the acoustic speech signal. An overview of methods for stress classification is presented. First, we review traditional pitch-based methods for stress detection and classification. Second, neural network based stress classifiers with cepstral-based features, as well as wavelet-based classification algorithms are considered. The effect of stress on linear speech features is discussed, followed by the application of linear features and the Teager (1990) energy operator (TEO) based nonlinear features for effective stress classification. A new evaluation for stress classification and assessment is presented using a critical band frequency partition based the TEO feature and the combination of several linear features. Results using NATO databases of actual speech under stress are presented. Finally, we discuss issues relating to stress classification across known and unknown speakers and suggest areas for further research.