Speech Recognition Based on Feature Extraction with Variable Rate Frequency Sampling

  • Authors:
  • Ilyas Potamitis;Nikos Fakotakis;George K. Kokkinakis

  • Affiliations:
  • -;-;-

  • Venue:
  • TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most feature extraction techniques involve in their primary stage a Discrete Fourier Transform (DFT) of consecutive, short, overlapping windows. The spectral resolution of the DFT representation is uniform and is given by Δf = 2π/N where N is the length of the window The present paper investigates the use of non-uniform rate frequency sampling, varying as a function of the spectral characteristics of each frame, in the context of Automatic Speech Recognition. We are motivated by the non-uniform spectral sensitivity of human hearing and the necessity for a feature extraction technique that auto-focuses on most reliable parts of the spectrum in noisy cases.