Adaptive hierarchy of hidden Markov models for transformation-based adaptation

Authors:
Jen-Tzung Chien
Affiliations:
Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan, ROC
Venue:
Speech Communication
Year:
2002

Citing 5
Cited 0

Telephone speech recognition based on Bayesian adaptation of hidden Markov models

Speech Communication
Unsupervised hierarchical adaptation using reliable selection of cluster-dependent parameters

Speech Communication
An experimental study of acoustic adaptation algorithms

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Speaker adaptation with autonomous model complexity control by MDL principle

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Irrelevant variability normalization in learning HMM state tying from data based on phonetic decision-tree

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02

Quantified Score

Hi-index	0.00

Visualization

Abstract

Transformation-based adaptation, which transforms clusters of speaker-independent (SI) hidden Markov model (HMM) parameters to an enrolled speaker by using cluster-dependent transformation functions, is an effective algorithm for robust speech recognition. To obtain desirable performance for any amount of adaptation data, it is beneficial to establish a tree structure of HMM parameters and apply it to dynamically control the sharing of transformation parameters. Traditionally, the transformation sharing is determined by phonetic rules or by clustering the acoustic space of training data. The tree structure is then kept unchanged for speaker adaptation (SA). In this paper, we adapt the tree structure to new environment such that the transformation parameters can be extracted adaptively by referring to the newest hierarchy of HMM parameters. The adaptation of hierarchical tree is herein combined into the maximum likelihood (ML) estimation of transformation parameters. From a series of speaker adaptation experiments, we find that the transformation-based adaptation with adaptive hierarchy of HMM parameters outperforms that with the static hierarchy for different cases of tree depths and adaptation data lengths.