An HMM-based approach to automatic phrasing for Mandarin text-to-speech synthesis

Authors:
Jing Zhu;Jian-Hua Li
Affiliations:
Shanghai Jiao Tong University;Shanghai Jiao Tong University
Venue:
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Year:
2006

Citing 5
Cited 0

Fundamentals of speech recognition

Fundamentals of speech recognition
Training intonational phrasing rules automatically for English and Spanish text-to-speech

Speech Communication
A Chinese Text-to-Speech System Based on Part-of-Speech Analysis, Prosodic Modeling and Non-Uniform Units

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
An RNN-based algorithm to detect prosodic phrase for Chinese TTS

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Segmenting unrestricted Chinese text into prosodic words instead of lexical words

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic phrasing is essential to Mandarin text-to-speech synthesis. We select word format as target linguistic feature and propose an HMM-based approach to this issue. Then we define four states of prosodic positions for each word when employing a discrete hidden Markov model. The approach achieves high accuracy of roughly 82%, which is very close to that from manual labeling. Our experimental results also demonstrate that this approach has advantages over those part-of-speech-based ones.