Analysis of acoustic features in speakers with cognitive disorders and speech impairments

Authors:
Oscar Saz;Javier Simón;W.-Ricardo Rodríguez;Eduardo Lleida;Carlos Vaquero
Affiliations:
Communications Technology Group, Aragón Institute for Engineering Research, University of Zaragoza, Zaragoza, Spain;Department of General and Hispanic Linguistics, University of Zaragoza, Zaragoza, Spain;Communications Technology Group, Aragón Institute for Engineering Research, University of Zaragoza, Zaragoza, Spain;Communications Technology Group, Aragón Institute for Engineering Research, University of Zaragoza, Zaragoza, Spain;Communications Technology Group, Aragón Institute for Engineering Research, University of Zaragoza, Zaragoza, Spain
Venue:
EURASIP Journal on Advances in Signal Processing - Special issue on analysis and signal processing of oesophageal and pathological voices
Year:
2009

Citing 2
Cited 0

Elements of information theory

Elements of information theory
Improving dialogue systems in a home automation environment

Proceedings of the 1st international conference on Ambient media and systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work presents the results in the analysis of the acoustic features (formants and the three suprasegmental features: tone, intensity and duration) of the vowel production in a group of 14 young speakers suffering different kinds of speech impairments due to physical and cognitive disorders. A corpus with unimpaired children's speech is used to determine the reference values for these features in speakers without any kind of speech impairment within the same domain of the impaired speakers; this is 57 isolated words. The signal processing to extract the formant and pitch values is based on a Linear Prediction Coefficients (LPCs) analysis of the segments considered as vowels in a Hidden Markov Model (HMM) based Viterbi forced alignment. Intensity and duration are also based in the outcome of the automated segmentation. As main conclusion of the work, it is shown that intelligibility of the vowel production is lowered in impaired speakers even when the vowel is perceived as correct by human labelers. The decrease in intelligibility is due to a 30% of increase in confusability in the formants map, a reduction of 50% in the discriminative power in energy between stressed and unstressed vowels and to a 50% increase of the standard deviation in the length of the vowels. On the other hand, impaired speakers keep good control of tone in the production of stressed and unstressed vowels.