The acoustic-modeling problem in automatic speech recognition
The acoustic-modeling problem in automatic speech recognition
Hi-index | 0.00 |
In this paper, we are looking into the adaptation issues of vocabulary independent (VI) systems. Just as with speaker-adaptation in speaker-independent system, two vocabulary learning algorithms are implemented in order to tailor the VI subword models to the target vocabulary. The first algorithm is to generate vocabularyadapted clustering decision trees by focusing on relevant allophones during tree generation and reduces the VI error rate by 9%. The second algorithm, vocabulary-bias training, is to give the relevant allophones more prominence by assign more weight to them during Baum-Welch training of the generalized allophonic models and reduces the VI error rate by 15%. Finally, in order to overcome the degradation causing by the different acoustic environments used for VI training and testing, CDCN and ISDCN originally designed for microphone adaptation are incorporated into our VI system and both reduce the degradation of VI cross-environment recognition by 50%.