Multilingual speech corpora for TTS system development

Authors:
Hsi-Chun Hsiao;Hsiu-Min Yu;Yih-Ru Wang;Sin-Horng Chen
Affiliations:
Department of Communication Engineering, Chiao Tung University, Hsinchu;Department of Foreign Languages, Chung Hua University, Hsinchu;Department of Communication Engineering, Chiao Tung University, Hsinchu;Department of Communication Engineering, Chiao Tung University, Hsinchu
Venue:
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Year:
2006

Citing 2
Cited 0

Sinica Treebank: design criteria, annotation guidelines, and on-line interface

CLPW '00 Proceedings of the second workshop on Chinese language processing: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 12
A hakka text-to-speech system

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, four speech corpora collected in the Speech Lab of NCTU in recent years are discussed. They include a Mandarin tree-bank speech corpus, a Min-Nan speech corpus, a Hakka speech corpus, and a Chinese-English mixed speech corpus. Currently, they are used separately to develop a corpus-based Mandarin TTS system, a Min-Nan TTS system, a Hakka TTS system, and a Chinese-English bilingual TTS system. These systems will be integrated in the future to construct a multilingual TTS system covering the four primary languages used in Taiwan.