Speaker adaptive phoneme recognition based on feature mapping from spectral domain to probabilistic domain

Authors:
T. Kobayashi;Y. Uchiyama;J. Osada;K. Shirai
Affiliations:
Department of Electrical Engineering, Waseda University, Tokyo, Japan;Department of Electrical Engineering, Hosei University, Tokyo, Japan;Department of Electrical Engineering, Hosei University, Tokyo, Japan;Department of Electrical Engineering, Waseda University, Tokyo, Japan
Venue:
ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Year:
1992

Citing 1
Cited 1

On the approximate realization of continuous mappings by neural networks

Neural Networks

Speaker-independent features extracted by a neural network

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: plenary, special, audio, underwater acoustics, VLSI, neural networks - Volume I

Quantified Score

Hi-index	0.00

Visualization

Abstract

A new feature parameter space for speech recognition called PRPG (Probability Ratios between Phoneme Group pairs) has been proposed and speaker adaptive phoneme recognition has been performed. In the coordinate system proposed here, the area with the same information for speech recognition is compressed into one point. The mapping function from spectral coordinate system to proposed one is realized using a neural network. The code-vectors designed on this coordinate system are assured to be information-theoretically more efficient than that of spectral coordinate system. Moreover, by the definition of the coordinate system, the meaning of axes are equivalent among different speakers, so the speaker adaptation can be easily performed without trajectory mapping. The experimental results show that the 40% of errors are reduced by the coordinate conversion in the speaker-dependent tasks. The scores of the speaker-adaptive tasks in the proposed feature domain are always superior to those of the speaker-dependent tasks in the spectral domain.