Multilingual speech corpora for TTS system development

  • Authors:
  • Hsi-Chun Hsiao;Hsiu-Min Yu;Yih-Ru Wang;Sin-Horng Chen

  • Affiliations:
  • Department of Communication Engineering, Chiao Tung University, Hsinchu;Department of Foreign Languages, Chung Hua University, Hsinchu;Department of Communication Engineering, Chiao Tung University, Hsinchu;Department of Communication Engineering, Chiao Tung University, Hsinchu

  • Venue:
  • ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, four speech corpora collected in the Speech Lab of NCTU in recent years are discussed. They include a Mandarin tree-bank speech corpus, a Min-Nan speech corpus, a Hakka speech corpus, and a Chinese-English mixed speech corpus. Currently, they are used separately to develop a corpus-based Mandarin TTS system, a Min-Nan TTS system, a Hakka TTS system, and a Chinese-English bilingual TTS system. These systems will be integrated in the future to construct a multilingual TTS system covering the four primary languages used in Taiwan.