A hakka text-to-speech system

Authors:
Hsiu-Min Yu;Hsin-Te Hwang;Dong-Yi Lin;Sin-Horng Chen
Affiliations:
Department of Foreign Languages, Chung Hua University, Hsinchu;Department of Communication Engineering, National Chiao Tung University, Hsinchu;Department of Communication Engineering, National Chiao Tung University, Hsinchu;Department of Communication Engineering, National Chiao Tung University, Hsinchu
Venue:
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Year:
2006

Citing 1
Cited 1

Neural Networks: A Comprehensive Foundation

Neural Networks: A Comprehensive Foundation

Multilingual speech corpora for TTS system development

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, the implementation of a Hakka text-to-speech (TTS) system is presented. The system is designed based on the same principle of developing a Mandarin and a Min-Nan TTS systems proposed previously. It takes 671 base-syllables as basic synthesis units and uses a recurrent neural network (RNN)-based prosody generator to generate proper prosodic parameters for synthesizing natural output speech. The whole system is implemented by software and runs in real-time on PC. Informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded well for well-tokenized texts and fair for texts with automatic tokenization.