Speaker adaptive phoneme recognition based on feature mapping from spectral domain to probabilistic domain

  • Authors:
  • T. Kobayashi;Y. Uchiyama;J. Osada;K. Shirai

  • Affiliations:
  • Department of Electrical Engineering, Waseda University, Tokyo, Japan;Department of Electrical Engineering, Hosei University, Tokyo, Japan;Department of Electrical Engineering, Hosei University, Tokyo, Japan;Department of Electrical Engineering, Waseda University, Tokyo, Japan

  • Venue:
  • ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

A new feature parameter space for speech recognition called PRPG (Probability Ratios between Phoneme Group pairs) has been proposed and speaker adaptive phoneme recognition has been performed. In the coordinate system proposed here, the area with the same information for speech recognition is compressed into one point. The mapping function from spectral coordinate system to proposed one is realized using a neural network. The code-vectors designed on this coordinate system are assured to be information-theoretically more efficient than that of spectral coordinate system. Moreover, by the definition of the coordinate system, the meaning of axes are equivalent among different speakers, so the speaker adaptation can be easily performed without trajectory mapping. The experimental results show that the 40% of errors are reduced by the coordinate conversion in the speaker-dependent tasks. The scores of the speaker-adaptive tasks in the proposed feature domain are always superior to those of the speaker-dependent tasks in the spectral domain.