Speaker adaptation using multiple reference speakers

Authors:
Francis Kubala;Richard Schwartz;Chris Barry
Affiliations:
BBN Systems and Technologies Corporation, Cambridge, MA;BBN Systems and Technologies Corporation, Cambridge, MA;BBN Systems and Technologies Corporation, Cambridge, MA
Venue:
HLT '89 Proceedings of the workshop on Speech and Natural Language
Year:
1989

Citing 1
Cited 1

Speaker adaptation from limited training in the BBN BYBLOS Speech Recognition system

HLT '89 Proceedings of the workshop on Speech and Natural Language

A study on speaker-adaptive speech recognition

HLT '91 Proceedings of the workshop on Speech and Natural Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce a new technique for using the speech of multiple reference speakers as a basis for speaker adaptation in large vocabulary continuous speech recognition. In contrast to other methods that use a pooled reference model, this technique normalizes the training speech from multiple reference speakers to a single common feature space before pooling it. The normalized and pooled speech can then be treated as if it came from a single reference speaker for training the reference hidden Markov model (HMM). Our usual probabilistic spectrum transformation can be applied to the reference HMM to model a new (target) speaker. In this paper, we describe our baseline (single reference speaker) speaker-adaptation system and give current performance results from a recent formal evaluation of the system. We also describe our proposal for adapting from multiple reference speakers and report on recent preliminary experimental results in support of the proposed technique.