Development of syllable-based text to speech synthesis system in Bengali

Authors:
N. P. Narendra;K. Sreenivasa Rao;Krishnendu Ghosh;Ramu Reddy Vempada;Sudhamay Maity
Affiliations:
School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, India 721302;School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, India 721302;School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, India 721302;School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, India 721302;School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, India 721302
Venue:
International Journal of Speech Technology
Year:
2011

Citing 6
Cited 4

Machine Learning

Machine Learning
CHATR: a generic speech synthesis system

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Unit selection in a concatenative speech synthesis system using a large speech database

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Springer Handbook of Speech Processing

Springer Handbook of Speech Processing
Review: Statistical parametric speech synthesis

Speech Communication
Arabic speech synthesis using a concatenation of polyphones: the results

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence

Syllable Specific Unit Selection Cost Functions for Text-to-Speech Synthesis

ACM Transactions on Speech and Language Processing (TSLP)
Optimal weight tuning method for unit selection cost functions in syllable based text-to-speech synthesis

Applied Soft Computing
Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis

Computer Speech and Language
Pitch synchronous and glottal closure based speech analysis for language recognition

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the design and development of unrestricted text to speech synthesis (TTS) system in Bengali language. Unrestricted TTS system is capable to synthesize good quality of speech in different domains. In this work, syllables are used as basic units for synthesis. Festival framework has been used for building the TTS system. Speech collected from a female artist is used as speech corpus. Initially five speakers' speech is collected and a prototype TTS is built from each of the five speakers. Best speaker among the five is selected through subjective and objective evaluation of natural and synthesized waveforms. Then development of unrestricted TTS is carried out by addressing the issues involved at each stage to produce good quality synthesizer. Evaluation is carried out in four stages by conducting objective and subjective listening tests on synthesized speech. At the first stage, TTS system is built with basic festival framework. In the following stages, additional features are incorporated into the system and quality of synthesis is evaluated. The subjective and objective measures indicate that the proposed features and methods have improved the quality of the synthesized speech from stage-2 to stage-4.