Sinusoidal model-based analysis and classification of stressed speech

Authors:
S. Ramamohan;S. Dandapat
Affiliations:
Intel Technol. India Pvt. Ltd., Bangalore, India;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2006

Citing 0
Cited 5

A non-probabilistic recognizer of stochastic signals based on KLT

Signal Processing
Harmonic model for female voice emotional synthesis

BioID_MultiComm'09 Proceedings of the 2009 joint COST 2101 and 2102 international conference on Biometric ID management and multimodal communication
Improvement to speech-music discrimination using sinusoidal model based features

Multimedia Tools and Applications
Analysis of stressed human speech

International Journal of Computational Vision and Robotics
Spectral slope based analysis and classification of stressed speech

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a sinusoidal model has been proposed for characterization and classification of different stress classes (emotions) in a speech signal. Frequency, amplitude and phase features of the sinusoidal model are analyzed and used as input features to a stressed speech recognition system. The performances of sinusoidal model features are evaluated for recognition of different stress classes with a vector-quantization classifier and a hidden Markov model classifier. To find the effectiveness of these features for recognition of different emotions in different languages, speech signals are recorded and tested in two languages, Telugu (an Indian language) and English. Average stressed speech index values are proposed for comparing differences between stress classes in a speech signal. Results show that sinusoidal model features are successful in characterizing different stress classes in a speech signal. Sinusoidal features perform better compared to the linear prediction and cepstral features in recognizing the emotions in a speech signal.