Context-independent phoneme recognition using a K-Nearest Neighbour classification approach

  • Authors:
  • Ladan Golipour;Douglas O'Shaughnessy

  • Affiliations:
  • INRS-EMT, Quebec University, Montreal, Canada;INRS-EMT, Quebec University, Montreal, Canada

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we investigate a non-parametric classification of English phonemes in speaker-independent continuous speech. We employ the “voting” k-Nearest Neighbour (k-NN) classifier, a powerful technique in pattern recognition problems, along with a new representation of phonemes for the speech recognition task. We also exploit the idea behind “approximate” k-NN that results in a very fast way of computing the k approximate closest neighbours of each data point. Comparing the recognition performance of the proposed method with the HMM-based recognizer of HTK toolkit reveals that the k-NN-based recognizer outperforms its counterpart. In addition, incorporating the “approximate” nearest neighbour search instead of the “exact” one results in completing the training step much faster than the HMM-based system, and the testing step with a comparable computational time. We also reduced the amount of the training data by applying a pattern recognition technique, called “thinning” algorithm. The outcome was a considerable reduction in the k-NN search space and hence the execution time, and also a slight increase in the recognition performance.