Phoneme HMMs constrained by frame correlations

  • Authors:
  • Satoshi Takahashi;Tatsuo Matsuoka;Yasuhiro Minami;Kiyohiro Shikano

  • Affiliations:
  • NTT Human Interface Laboratories, Musashino-Shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-Shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-Shi, Tokyo, Japan;NTT Human Interface Laboratories, Musashino-Shi, Tokyo, Japan

  • Venue:
  • ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes new Hidden Markov Models (HMMs) that use correlations between two frames to constrain the feature distributions to the region that is appropriate for an input speaker. This makes it possible to reduce the overlapping of feature distributions between different phonemes. In ICASSP92, we proposed the bigram-constrained HMM based on the combination of the discrete speaker-independent HMM and the VQ-code bigram, and showed that it performed better than a conventional speaker-independent HMM. In this paper, tied-mixture HMMs are adopted to create the tied-mixture type bigram-constrained HMM to obtain better recognition performance. Furthermore, the strategy is extended to the continuous HMM. These three types of HMMs are formulated and evaluated by phoneme recognition in continuous speech.