Disordered speech assessment using automatic methods based on quantitative measures
EURASIP Journal on Applied Signal Processing
Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech
Proceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility
PEAKS - A system for the automatic evaluation of voice and speech disorders
Speech Communication
A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech
IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
Computer Speech and Language
Hi-index | 0.00 |
Objective measurement of dysarthric speech intelligibility can assist clinicians in the diagnosis of speech disorder severity as well as in the evaluation of dysarthria treatments. In this paper, several objective measures are proposed and tested as correlates of subjective intelligibility. More specifically, the kurtosis of the linear prediction residual is proposed as a measure of vocal source excitation oddity. Additionally, temporal perturbations resultant from imprecise articulation and atypical speech rates are characterized by short- and long-term temporal dynamics measures, which in turn, are based on log-energy dynamics and on an auditory-inspired modulation spectral signal representation, respectively. Motivated by recent insights in the communication disorders literature, a composite measure is developed based on linearly combining a salient subset of the proposed measures with conventional prosodic parameters. Experiments with the publicly-available 'Universal Access' database of spastic dysarthric speech (10 patient speakers; 300 words spoken in isolation, per speaker) show that the proposed composite measure can achieve correlation with subjective intelligibility ratings as high as 0.97; thus the measure can serve as an accurate indicator of dysarthric speech intelligibility.