Quaternion Fourier Descriptors for the Preprocessing and Recognition of Spoken Words Using Images of Spatiotemporal Representations

  • Authors:
  • Eduardo Bayro-Corrochano;Noel Trujillo;Michel Naranjo

  • Affiliations:
  • Electrical Engineering and Computer Science Department, CINVESTAV, Centro de Investigación y de Estudios Avanzados, Guadalajara, Mexico Jal. 44550;GRAVIR/LASMEA, University Blaise Pascal, Clermont-Ferrand, France;GRAVIR/LASMEA, University Blaise Pascal, Clermont-Ferrand, France

  • Venue:
  • Journal of Mathematical Imaging and Vision
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an application of the quaternion Fourier transform for the preprocessing for neural-computing. In a new way the 1D acoustic signals of French spoken words are represented as 2D signals in the frequency and time domain. These kind of images are then convolved in the quaternion Fourier domain with a quaternion Gabor filter for the extraction of features. This approach allows to greatly reduce the dimension of the feature vector. Two methods of feature extraction are tested. The features vectors were used for the training of a simple MLP, a TDNN and a system of neural experts. The improvement in the classification rate of the neural network classifiers are very encouraging which amply justify the preprocessing in the quaternion frequency domain. This work also suggests the application of the quaternion Fourier transform for other image processing tasks.