Speech Recognition Based on Feature Extraction with Variable Rate Frequency Sampling

Authors:
Ilyas Potamitis;Nikos Fakotakis;George K. Kokkinakis
Affiliations:
-;-;-
Venue:
TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Year:
2001

Citing 2
Cited 0

Towards increasing speech recognition error rates

Speech Communication
Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most feature extraction techniques involve in their primary stage a Discrete Fourier Transform (DFT) of consecutive, short, overlapping windows. The spectral resolution of the DFT representation is uniform and is given by Δf = 2π/N where N is the length of the window The present paper investigates the use of non-uniform rate frequency sampling, varying as a function of the spectral characteristics of each frame, in the context of Automatic Speech Recognition. We are motivated by the non-uniform spectral sensitivity of human hearing and the necessity for a feature extraction technique that auto-focuses on most reliable parts of the spectrum in noisy cases.