Machine Learning
CHATR: a generic speech synthesis system
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Unit selection in a concatenative speech synthesis system using a large speech database
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Springer Handbook of Speech Processing
Springer Handbook of Speech Processing
Review: Statistical parametric speech synthesis
Speech Communication
Arabic speech synthesis using a concatenation of polyphones: the results
AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Syllable Specific Unit Selection Cost Functions for Text-to-Speech Synthesis
ACM Transactions on Speech and Language Processing (TSLP)
Computer Speech and Language
Pitch synchronous and glottal closure based speech analysis for language recognition
International Journal of Speech Technology
Hi-index | 0.00 |
This paper presents the design and development of unrestricted text to speech synthesis (TTS) system in Bengali language. Unrestricted TTS system is capable to synthesize good quality of speech in different domains. In this work, syllables are used as basic units for synthesis. Festival framework has been used for building the TTS system. Speech collected from a female artist is used as speech corpus. Initially five speakers' speech is collected and a prototype TTS is built from each of the five speakers. Best speaker among the five is selected through subjective and objective evaluation of natural and synthesized waveforms. Then development of unrestricted TTS is carried out by addressing the issues involved at each stage to produce good quality synthesizer. Evaluation is carried out in four stages by conducting objective and subjective listening tests on synthesized speech. At the first stage, TTS system is built with basic festival framework. In the following stages, additional features are incorporated into the system and quality of synthesis is evaluated. The subjective and objective measures indicate that the proposed features and methods have improved the quality of the synthesized speech from stage-2 to stage-4.