The HDM: a segmental hidden dynamic model of coarticulation

Authors:
H. B. Richards;J. S. Bridle
Affiliations:
Dragon Syst. UK, Cheltenham, UK;-
Venue:
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Year:
1999

Citing 0
Cited 6

Diphone subspace mixture trajectory models for HMM Complementation

Speech Communication
Inverting mappings from smooth paths through Rn to paths through Rm: A technique applied to recovering articulation from acoustics

Speech Communication
Articulatory feature recognition using dynamic Bayesian networks

Computer Speech and Language
Evaluation of the robustness of the polynomial segment models to noisy environments with unsupervised adaptation

Speech Communication
Statistical identification of articulation constraints in the production of speech

Speech Communication
Review: Statistical parametric speech synthesis

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces a new approach to acoustic-phonetic modelling, the hidden dynamic model (HDM), which explicitly accounts for the coarticulation and transitions between neighbouring phones. Inspired by the fact that speech is really produced by an underlying dynamic system, the HDM consists of a single vector target per phone in a hidden dynamic space in which speech trajectories are produced by a simple dynamic system. The hidden space is mapped to the surface acoustic representation via a non-linear mapping in the form of a multilayer perceptron (MLP). Algorithms are presented for training of all the parameters (target vectors and MLP weights) from segmented and labelled acoustic observations alone, with no special initialisation. The model captures the dynamic structure of speech, and appears to aid a speech recognition task based on the SwitchBoard corpus.