Towards increasing speech recognition error rates
Speech Communication
Discrete Time Processing of Speech Signals
Discrete Time Processing of Speech Signals
Hi-index | 0.00 |
Most feature extraction techniques involve in their primary stage a Discrete Fourier Transform (DFT) of consecutive, short, overlapping windows. The spectral resolution of the DFT representation is uniform and is given by Δf = 2π/N where N is the length of the window The present paper investigates the use of non-uniform rate frequency sampling, varying as a function of the spectral characteristics of each frame, in the context of Automatic Speech Recognition. We are motivated by the non-uniform spectral sensitivity of human hearing and the necessity for a feature extraction technique that auto-focuses on most reliable parts of the spectrum in noisy cases.