Objective evaluation of speech dysfluencies using wavelet packet transform with sample entropy

Authors:
M. Hariharan;C. Y. Fook;R. Sindhu;Abdul Hamid Adom;Sazali Yaacob
Affiliations:
School of Mechatronic Engineering, University Malaysia Perlis (UniMAP), 02600, Campus Pauh Putra, Perlis, Malaysia;School of Mechatronic Engineering, University Malaysia Perlis (UniMAP), 02600, Campus Pauh Putra, Perlis, Malaysia;School of Microelectronic Engineering, University Malaysia Perlis (UniMAP), 02600, Campus Pauh Putra, Perlis, Malaysia;School of Mechatronic Engineering, University Malaysia Perlis (UniMAP), 02600, Campus Pauh Putra, Perlis, Malaysia;School of Mechatronic Engineering, University Malaysia Perlis (UniMAP), 02600, Campus Pauh Putra, Perlis, Malaysia
Venue:
Digital Signal Processing
Year:
2013

Citing 14
Cited 0

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Intelligent Processing of Stuttered Speech

Journal of Intelligent Information Systems
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
A novel approach for digital radio signal classification: Wavelet packet energy-multiclass support vector machine (WPE-MSVM)

Expert Systems with Applications: An International Journal
Speaker identification using discrete wavelet packet transform technique with irregular decomposition

Expert Systems with Applications: An International Journal
An expert system for fault diagnosis in internal combustion engines using wavelet packet transform and neural network

Expert Systems with Applications: An International Journal
An intelligent fault diagnosis method based on wavelet packer analysis and hybrid support vector machines

Expert Systems with Applications: An International Journal
Optimal parameters study for sample entropy-based atrial fibrillation organization analysis

Computer Methods and Programs in Biomedicine
Classification of speech dysfluencies with MFCC and LPCC features

Expert Systems with Applications: An International Journal
Optimized orthonormal wavelet filters with improved frequency separation

Digital Signal Processing
Recovery of the optimal approximation from samples in wavelet subspace

Digital Signal Processing
Performance comparison of wavelet based and conventional OFDM systems in multipath Rayleigh fading channels

Digital Signal Processing
Pathological infant cry analysis using wavelet packet transform and probabilistic neural network

Expert Systems with Applications: An International Journal
Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques

Journal of Medical Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dysfluency and stuttering are a break or interruption of normal speech such as repetition, prolongation, interjection of syllables, sounds, words or phrases and involuntary silent pauses or blocks in communication. Stuttering assessment through manual classification of speech dysfluencies is subjective, inconsistent, time consuming and prone to error. This paper proposes an objective evaluation of speech dysfluencies based on the wavelet packet transform with sample entropy features. Dysfluent speech signals are decomposed into six levels by using wavelet packet transform. Sample entropy (SampEn) features are extracted at every level of decomposition and they are used as features to characterize the speech dysfluencies (stuttered events). Three different classifiers such as k-nearest neighbor (kNN), linear discriminant analysis (LDA) based classifier and support vector machine (SVM) are used to investigate the performance of the sample entropy features for the classification of speech dysfluencies. 10-fold cross validation method is used for testing the reliability of the classifier results. The effect of different wavelet families on the classification performance is also performed. Experimental results demonstrate that the proposed features and classification algorithms give very promising classification accuracy of 96.67% with the standard deviation of 0.37 and also that the proposed method can be used to help speech language pathologist in classifying speech dysfluencies.